Voice Actor: CX Agent Voice Cloning (Spanish - Peninsular)
Remote · Contract · $50-150/hr
Mercor is partnering with a leading AI research lab to build next-generation text-to-speech (TTS) systems capable of producing natural, expressive, and human-like voices. We are seeking Spain-based voice actors with native Castilian/Peninsular Spanish accents to contribute high-quality voice recordings for training and evaluating cutting-edge speech models. This role is ideal for professionals with experience in voice acting, narration, or broadcast who can deliver consistent, expressive, and clean audio across a variety of scripts.
Your voice recordings will be used exclusively for the client's internal CX AI Agent. They will not be sold, licensed, or reused for any other product, dataset, or purpose.
Key Responsibilities
Record high-quality voice samples across diverse scripts (conversational, narrative, instructional, etc.)
Deliver clear, natural, and expressive speech with strong control over tone, pacing, and pronunciation
Maintain consistency in voice, accent, and delivery across recording sessions
Follow detailed recording guidelines (environment, microphone setup, file formatting)
Perform multiple takes with variation in emotion, emphasis, and style when required
Requirements
Native Castilian/Peninsular Spanish speaker currently based in Spain
Proven experience in voice acting, dubbing, narration, podcasting, or broadcasting
Access to a professional or near-professional recording setup (quality microphone, quiet environment, pop filter, etc.)
Strong command of intonation, diction, and emotional range
Ability to follow scripts precisely while maintaining natural delivery
Reliable availability for 5-10 hours per week over the project duration
[IMP]: Your voice may be cloned for the client's CX AI Agent so please only apply if you are okay with voice cloning. Your recordings will not be sold or reused outside of this specific use case.
Preferred Qualifications
Experience recording for TTS, audiobooks, IVR systems, or AI voice datasets
Familiarity with audio editing tools (e.g., Audacity, Adobe Audition, Reaper)
Ability to deliver multiple vocal styles (e.g., conversational, corporate, energetic, calm)
Listing sourced from Mercor. Annotation Academy is independent of these platforms and does not guarantee work or pay. See our disclosures.