ai

All things AI related...current home of the svelte LLM benchmark

by sveltejs

Star on GitHub ForkWebsite npm

TypeScript

38 stars 4 forks 2 contributorsQuiet · 4mo agoSince 2025

Meet the team

See all 2 on GitHub →

khromov188 contributions

paoloricciuti20 contributions

Languages

View on GitHub →

TypeScript88.7%

Svelte10.9%

JavaScript0.3%

Shell0.1%

Commit activity

Last 12 weeks · 0 commits

Full graph →

Community health

1 of 6 standards met

Community profile →

✓README○License○Contributing○Code of Conduct○Issue Template○PR Template

Recent PRs & issues

Quiet · 4 in progress · Last activity 4mo ago

See all on GitHub →

feat: parallel testingOpenPR

This allows for parallel testing (so that it doesn't take 20 minutes per model to run the suite 😄)

paoloricciuti · 5mo ago

[Test Proposal]: Ability to correctly use reactive non-POJOsOpenIssue

Test Name svelte-reactivity Test Description It would be nice to test the ability of using purposefully non-deeply reactive values correctly, avoid common gotchas and avoid letting LLMs reimplement svelte/reactivity functionality. For example: https://svelte.dev/playground/6133806fa4ff4ce09e2f0d393e1d6084?version=5.46.4 Usage of createSubscriber too -- but I can't think of a way of testing a generalized approach that checks footguns like creating useless subscribers or a new one on every "get", etc. (Not in the proposed prompt) Skills would help a lot here, but interested in seeing their "native" approach. Proposed prompt was generated by Gemini 3 Flash, but changed most of it since it was too extensive. Even now it's not very generic. Proposed Prompt Component Task Create a dashboard that tracks multiple counters using different data structures. The goal is to demonstrate the nuances of Svelte 5's deep reactivity, specifically focusing on why values require explicit wrapping compared to standard POJOs. Requirements Deep Reactivity (POJO): Show a standard nested object where properties are automatically reactive (e.g., ). Correct SvelteMap Usage: Implement a SvelteMap where values are explicitly wrapped in before being stored, ensuring deep reactivity works as expected. Safe Lifecycle**: Ensure a proper cleanup function is returned. Reference Implementation (optional) Additional Context _No response_

MrVoshel · 5mo ago

Feature Request: more nuanced score for modelsOpenIssue

Right now, the models are scored solely based on the amount of test that they pass...a more nuanced score that also involves how the tests are passing would be wonderful. This could involve: Whether the model is using the MCP server or not Whether the model is using the Test tool or not The amount of step it took to complete The number of tokens it took to complete Possibly cost (?) Other ideas?

paoloricciuti · 5mo ago

Recent fixes

View closed PRs →

feat: skill and subagent to write test plus a bunch of testsMergedPR

I asked Claude to find a bunch of plausible prompt for the users to ask to the llm and then asked claude to implement them as test scenarios with the skill + subagent. I just pushed everything as I have to run so all the test must be verified before merging.

paoloricciuti · 5mo ago

feat: highlight result with `shiki`MergedPR

This is to show the generated code as highlighted in the html report

paoloricciuti · 5mo ago

Add LM Studio providerMergedPR

I know we said that we can add just use AI SDK, but if we want to test fine tunes we can't use that, so I still think it's good to have a way to do this as LM Studio can run anything from HF.

khromov · 6mo ago

Structured data for AI agents

Repository: sveltejs/ai. Description: All things AI related...current home of the svelte LLM benchmark Stars: 38, Forks: 4. Primary language: TypeScript. Languages: TypeScript (88.7%), Svelte (10.9%), JavaScript (0.3%), Shell (0.1%). Homepage: https://sveltejs.github.io/ai/ Open PRs: 4, open issues: 3. Last activity: 4mo ago. Community health: 50%. Top contributors: khromov, paoloricciuti.

ai

All things AI related...current home of the svelte LLM benchmark

by sveltejs

Star on GitHub ForkWebsite npm

TypeScript

38 stars 4 forks 2 contributorsQuiet · 4mo agoSince 2025

Meet the team

See all 2 on GitHub →

khromov188 contributions

paoloricciuti20 contributions

Languages

View on GitHub →

TypeScript88.7%

Svelte10.9%

JavaScript0.3%

Shell0.1%

Commit activity

Last 12 weeks · 0 commits

Full graph →

Community health

1 of 6 standards met

Community profile →

✓README○License○Contributing○Code of Conduct○Issue Template○PR Template

Recent PRs & issues

Quiet · 4 in progress · Last activity 4mo ago

See all on GitHub →

feat: parallel testingOpenPR

This allows for parallel testing (so that it doesn't take 20 minutes per model to run the suite 😄)

paoloricciuti · 5mo ago

[Test Proposal]: Ability to correctly use reactive non-POJOsOpenIssue

MrVoshel · 5mo ago

Feature Request: more nuanced score for modelsOpenIssue

paoloricciuti · 5mo ago

Recent fixes

View closed PRs →

feat: skill and subagent to write test plus a bunch of testsMergedPR

paoloricciuti · 5mo ago

feat: highlight result with `shiki`MergedPR

This is to show the generated code as highlighted in the html report

paoloricciuti · 5mo ago

Add LM Studio providerMergedPR

I know we said that we can add just use AI SDK, but if we want to test fine tunes we can't use that, so I still think it's good to have a way to do this as LM Studio can run anything from HF.

khromov · 6mo ago

Structured data for AI agents