Web Codegen Scorer evaluates the quality of frontend code produced by large language models. It provides a definitive way to determine whether AI-generated HTML, CSS, or JavaScript meets production standards or requires significant refactoring. By selecting a specific model, framework, and tooling, you can run automated checks in a test environment that mirrors your actual development setup through system instructions and MCP server integration.
The tool focuses on high-impact metrics: build success, runtime exceptions, accessibility (a11y) compliance, and security vulnerabilities. It also assigns an LLM-based quality grade and flags departures from established coding best practices. If a check fails, the scorer attempts an automated patch, providing a potential fix rather than just a failure report.
Flexible Configuration Compare performance across various models, frontend frameworks, and build pipelines.
Comprehensive Testing Built-in validation for build success, runtime stability, accessibility standards, and security hygiene.
Automated Repairs The system attempts to fix generated code automatically when errors are detected.
Visual Reporting Dashboards allow for side-by-side comparisons of different runs to identify exactly where specific models underperform.
npm install -g web-codegen-scorer
# Export the keys required for your chosen models
export GEMINI_API_KEY="your-key"
export OPENAI_API_KEY="your-key"
export ANTHROPIC_API_KEY="your-key"
Test the tool using the included Angular example.
web-codegen-scorer eval --env=angular-example
web-codegen-scorer init
• --env=<path> — Path to the environment configuration. (Required)
• --model=<name> — Specifies the LLM to be evaluated.
• --local — Bypasses the API to run scoring against previously generated code.
• --limit=<number> — Limits the evaluation to a specific number of prompts.
• --output-directory=<name> — Specifies the directory where results are saved.
• --concurrency=<number> — Limits the number of parallel API requests.
• --report-name=<name> — Sets a custom title for the generated report.
Dianman VPN: Free Trial, Unlimited Data & Zero Throttling
Open Source 3D Tetris in Your Browser With React and Three.js
Open Deep Research: Customizable AI Agents for Automated Report Generation
RunAgent: Build AI Agents in Python, Invoke Them Natively from Any Language
Common Ground: Multi-Agent Collaboration That Actually Works
Clueless: A Native AI Meeting Assistant for Mac with Live Transcription
Memos Self-Hosted Note App: Lightweight Markdown and API-First
Deploying AI Manus: Docker Compose Setup & Development Guide
Natural Language CAD Control via CAD-MCP Server
DeepWiki: Automatically Generate Interactive Wikis for Any GitHub Repository
Spacedrive: An Open-Source Cross-Platform File Manager
LiveTerm: A Next.js Terminal-Style Website Template