GitShow/sveltejs/ai
sveltejs

ai

All things AI related...current home of the svelte LLM benchmark

by sveltejs
Star on GitHubForkWebsitenpm

TypeScript

38 stars4 forks2 contributorsQuiet · 4mo agoSince 2025

Meet the team

See all 2 on GitHub →
khromov
khromov188 contributions
paoloricciuti
paoloricciuti20 contributions

Languages

View on GitHub →
TypeScript88.7%
Svelte10.9%
JavaScript0.3%
Shell0.1%

Commit activity

Last 12 weeks · 0 commits

Full graph →

Community health

1 of 6 standards met

Community profile →
50
✓README○License○Contributing○Code of Conduct○Issue Template○PR Template

Recent PRs & issues

Quiet · 4 in progress · Last activity 4mo ago
See all on GitHub →
paoloricciuti
feat: parallel testingOpenPR

This allows for parallel testing (so that it doesn't take 20 minutes per model to run the suite 😄)

paoloricciuti · 5mo ago
MrVoshel
[Test Proposal]: Ability to correctly use reactive non-POJOsOpenIssue

Test Name svelte-reactivity Test Description It would be nice to test the ability of using purposefully non-deeply reactive values correctly, avoid common gotchas and avoid letting LLMs reimplement svelte/reactivity functionality. For example: https://svelte.dev/playground/6133806fa4ff4ce09e2f0d393e1d6084?version=5.46.4 Usage of createSubscriber too -- but I can't think of a way of testing a generalized approach that checks footguns like creating useless subscribers or a new one on every "get", etc. (Not in the proposed prompt) Skills would help a lot here, but interested in seeing their "native" approach. Proposed prompt was generated by Gemini 3 Flash, but changed most of it since it was too extensive. Even now it's not very generic. Proposed Prompt Component Task Create a dashboard that tracks multiple counters using different data structures. The goal is to demonstrate the nuances of Svelte 5's deep reactivity, specifically focusing on why values require explicit wrapping compared to standard POJOs. Requirements Deep Reactivity (POJO): Show a standard nested object where properties are automatically reactive (e.g., ). Correct SvelteMap Usage: Implement a SvelteMap where values are explicitly wrapped in before being stored, ensuring deep reactivity works as expected. Safe Lifecycle**: Ensure a proper cleanup function is returned. Reference Implementation (optional) Additional Context _No response_

MrVoshel · 5mo ago
paoloricciuti
Feature Request: more nuanced score for modelsOpenIssue

Right now, the models are scored solely based on the amount of test that they pass...a more nuanced score that also involves how the tests are passing would be wonderful. This could involve: Whether the model is using the MCP server or not Whether the model is using the Test tool or not The amount of step it took to complete The number of tokens it took to complete Possibly cost (?) Other ideas?

paoloricciuti · 5mo ago

Recent fixes

View closed PRs →
paoloricciuti
feat: skill and subagent to write test plus a bunch of testsMergedPR

I asked Claude to find a bunch of plausible prompt for the users to ask to the llm and then asked claude to implement them as test scenarios with the skill + subagent. I just pushed everything as I have to run so all the test must be verified before merging.

paoloricciuti · 5mo ago
paoloricciuti
feat: highlight result with `shiki`MergedPR

This is to show the generated code as highlighted in the html report

paoloricciuti · 5mo ago
khromov
Add LM Studio providerMergedPR

I know we said that we can add just use AI SDK, but if we want to test fine tunes we can't use that, so I still think it's good to have a way to do this as LM Studio can run anything from HF.

khromov · 6mo ago
Structured data for AI agents

Repository: sveltejs/ai. Description: All things AI related...current home of the svelte LLM benchmark Stars: 38, Forks: 4. Primary language: TypeScript. Languages: TypeScript (88.7%), Svelte (10.9%), JavaScript (0.3%), Shell (0.1%). Homepage: https://sveltejs.github.io/ai/ Open PRs: 4, open issues: 3. Last activity: 4mo ago. Community health: 50%. Top contributors: khromov, paoloricciuti.

·@ofershap

Replace github.com with gitshow.dev