跳至主要内容
小龙虾小龙虾AI
🤖

Arxiv Agentic Verifier

Actively verifies Python/JS code correctness by generating targeted test cases that expose logic flaws based on problem constraints.

下载333
星标0
版本1.0.0
开发工具
安全通过
💬Prompt

技能说明

ArXiv Agentic Verifier

Source Paper: Scaling Agentic Verifier for Competitive Coding (ID: 4a4c4dae6a5145ebc4d62eb2d64b0f0f) Type: Code Verification / Test Generation

Description

This skill implements an "Agentic Verifier" that actively reasons about code correctness by generating targeted, "discriminative" test cases. Instead of random sampling, it analyzes the problem constraints and code logic to find edge cases or logic flaws.

Features

  • Analyze Code: Understands Python/JS code logic.
  • Generate Tests: Creates specific inputs to break the code.
  • Execute & Verify: Runs the code against generated tests (sandbox recommended for production).

Usage

const AgenticVerifier = require('./index');
const verifier = new AgenticVerifier(process.env.OPENAI_API_KEY);

const problem = "Given two integers A and B, output their sum.";
const code = "print(int(input().split()[0]) + int(input().split()[1]))";

verifier.verify(problem, code, 'python')
  .then(result => console.log(result))
  .catch(err => console.error(err));

Configuration

  • OPENAI_API_KEY: Required for LLM reasoning.

Security Warning

This skill executes code provided to it. Use in a restricted environment or sandbox.

如何使用「Arxiv Agentic Verifier」?

  1. 打开小龙虾AI(Web 或 iOS App)
  2. 点击上方「立即使用」按钮,或在对话框中输入任务描述
  3. 小龙虾AI 会自动匹配并调用「Arxiv Agentic Verifier」技能完成任务
  4. 结果即时呈现,支持继续对话优化

相关技能