Skip to main content
100 years and 10 days since the five-day weekRead the story
Posted 3 days ago

Video Caption Specialist

Part timeRemote · USA

Pay: $25-$25 per hour (USD).

Job Title: Video Caption Specialist

Job Type: Contractor

Location: Remote

Job Summary

Join our customer as a Video Caption Specialist, where you'll apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world input. No prior experience in AI is required — your domain knowledge is what matters.

Key Responsibilities

  1. Review short (5-second) videos of robots performing a variety of physical tasks while closely analyzing the corresponding AI-generated captions.
  2. Detect visual-text discrepancies, identifying any hallucinations (errors) or omissions in the captions compared to the real video content.
  3. Rewrite captions with clear, concise, and grammatically correct language, ensuring high accuracy in describing robot actions and motions.
  4. Emphasize the precision of robot movement and task execution in every caption, as these details are critical for model training.
  5. Maintain strict consistency with established project guidelines and rubrics, applying strong written judgment and editing standards.
  6. Meet or exceed daily throughput and quality benchmarks to ensure timely and reliable project delivery.
  7. Collaborate with the customer’s team, providing actionable feedback and sharing best practices to continually enhance caption accuracy.

Required Skills and Qualifications

  1. Fluency in English with excellent grammar, spelling, and written clarity.
  2. Keen attention to detail and demonstrated ability to spot subtle errors or inaccuracies.
  3. Comfortable analyzing and comparing short video clips and their written descriptions repeatedly.
  4. Strong judgment for rewriting and editing text for maximum accuracy and clarity.
  5. Ability to adhere to detailed instructions and maintain consistency across structured, repetitive review work.
  6. Reliable internet connection and a computer capable of smooth video streaming.
  7. Excellent communication skills, both written and verbal, with a strong care for clarity and accuracy.

Preferred Qualifications

  1. Experience in AI training data annotation, RLHF, or LLM evaluation.
  2. Background in writing, editing, journalism, technical writing, transcription, or copy editing.
  3. Familiarity with robotics, computer vision, or video annotation workflows.