We are seeking an analytical and technically-minded professional to:
- Evaluate AI outputs and processes
- Ensure quality, accuracy, and reliability
- Identify logical errors, risks, and structural inconsistencies
- Provide actionable insights and recommendations to the team
Ideal candidates:
- Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills
- Professionals curious about AI, process improvement, and quality evaluation
- Problem-solvers who enjoy analyzing complex systems, logic, and scenarios
Key Responsibilities:
- Lead evaluation of AI outputs and related processes
- Review tasks against expected/ideal scenarios; identify gaps and risks
- Provide structured, actionable recommendations to engineers, domain experts, and managers
- Maintain and improve evaluation guidelines, checklists, SOPs
- Suggest new approaches, tools, and processes to enhance AI evaluation
- Scenario validation, data analysis, auditing, or consulting experience
- Analytical work in research, technical/business analysis, or risk evaluation
Knowledge & Skills:
- Strong analytical and critical thinking
- Attention to detail, reliability, and an ownership mindset
- Technical understanding: JSON/YAML, basic Git/GitHub
- Independent, proactive mindset
Nice to Have:
- Scenario-based testing, annotation workflows, AI/LLM evaluation
- Experience in cross-functional teams
Frequently asked questions
What is the duration of the project?
What is the expected workload for the project?
What is the remote work policy for the project?
What is the daily rate for the project?
What language skills are required for the project?
Which industries is the project related to?
Which business areas does the project cover?
Not available? Can I still benefit from the project?
How to apply for the project?
Similar Projects
Evaluation Scenario Writer (m/w/d)
Freelance Automotive Engineer (with Python) - Quality Assurance / AI Trainer
ISO 20121 Auditor (w/m/d)
Vibe Coding Web Scraping Expert (m/f/d)
Quality Compliance Auditor (GCP/GCLP/GVP) (M/W/D)
Auditor – FSC® and PEFC Chain of Custody (m/f/d)
Social Compliance Auditor (m/f/d)
Area Product Manager (m/f/d)
Senior Project Manager Customer Interaction
Senior Regulatory Compliance Expert (FDA-Inspection Preparation) (m/f/d)
Freelance Electrical Engineer with Python Experience (m/w/d)
AI Consultant - Machine Learning (m/w/d)
ERP Transformation Manager (m/f/d)
Cyber Risk Consulting (Senior Level)
AI Consultants - Data Science (m/w/d)
Freelance Product Owner for Point Of Sale App
Commissioning & Qualification (C&Q) Engineer (m/f/d)
HSE Specialist – Body in White (M/W/D)
HSE Specialist – Cell Manufacturing
Project Manager (Project Control Focus) (m/f/d)
Management Consultant (Senior Level) (m/f/d)
Safety and Health Protection Coordinator (SiGeKo) and Safety Specialist (SiFa) (m/f/d)
Freelance Mechanical Engineer with Python Experience (m/w/d)
IT Project Manager ISO 27.001 - Gap Closure (m/f/d)
HSE Specialist – Facilities (M/W/D)
Senior Cloud Developer TypeScript (m/f/d)
Adobe Experience Cloud Consultant (m/f/d)
Interim Accounting Lead / Head Of (m/f/d)
Interim Staff Product Manager (m/w/d)
Consulting in the field of Tax Strategy