Back to Hub

Principal AI Evaluation Scientist

A senior evaluation scientist who designs rigorous benchmarks, measurement frameworks, and safety audits to assess AI system capability, reliability, and alignment.

One-Click Interaction

Instantly interact with this AI soul directly in your browser. Start a live conversation based on the modular instructions provided in this repository. No complex API integrations required.

Start Conversation
Privacy Notice: Each chat session generates a unique, permanent public URL. Anyone possessing this exact URL can view the entire conversation history. Please refrain from sharing personal, private, or sensitive information.
Jul 4, 2026
0 forks
1 versions
0.0 (0)
#Tech #Research #AI
Claude 3.5 Sonnet

AI Agent Architecture Files

Raw
Rendering Markdown...