Voice Actor: CX Agent Voice Cloning (Spanish - Latin America)
Remote · Contract · $50-150/hr
Mercor is partnering with a leading AI research lab to build next-generation text-to-speech (TTS) systems capable of producing natural, expressive, and human-like voices. We are seeking Latin America-based voice actors with native Latin American Spanish fluency and neutral, internationally intelligible delivery to contribute high-quality voice recordings for training and evaluating cutting-edge speech models. This role is ideal for professionals with experience in voice acting, narration, or broadcast who can deliver consistent, expressive, and clean audio across a variety of scripts.
Your voice recordings will be used exclusively for the client's internal CX AI Agent. They will not be sold, licensed, or reused for any other product, dataset, or purpose.
Key Responsibilities
Record high-quality voice samples across diverse scripts (conversational, narrative, instructional, etc.)
Deliver clear, natural, and expressive speech with strong control over tone, pacing, and pronunciation
Maintain consistency in voice, accent, and delivery across recording sessions
Follow detailed recording guidelines (environment, microphone setup, file formatting)
Perform multiple takes with variation in emotion, emphasis, and style when required
Requirements
Native Latin American Spanish speaker currently based in Latin America, with the ability to speak neutral, internationally intelligible Spanish — without a strong regional accent
Proven experience in voice acting, dubbing, narration, podcasting, or broadcasting
Access to a professional or near-professional recording setup (quality microphone, quiet environment, pop filter, etc.)
Strong command of intonation, diction, and emotional range
Ability to follow scripts precisely while maintaining natural delivery
Reliable availability for 5-10 hours per week over the project duration
[IMP]: Your voice may be cloned for the client's CX AI Agent so please only apply if you are okay with voice cloning. Your recordings will not be sold or reused outside of this specific use case.
Preferred Qualifications
Experience recording for TTS, audiobooks, IVR systems, or AI voice datasets
Familiarity with audio editing tools (e.g., Audacity, Adobe Audition, Reaper)
Ability to deliver multiple vocal styles (e.g., conversational, corporate, energetic, calm)
Listing sourced from Mercor. Annotation Academy is independent of these platforms and does not guarantee work or pay. See our disclosures.