TL;DR

Language Engineer (AI): Developing diverse datasets to train and evaluate Amazon AI models with an accent on synthetic data generation, model-supported data generation, and human-in-the-loop data collections. Focus on designing data collections, analyzing data, and building tools for data analysis or creation.

Location: USA, WA, Bellevue; USA, MA, Boston; USA, CA, Sunnyvale

Salary: USA, CA, Sunnyvale - 86,500.00 - 151,400.00 USD annually; USA, MA, Boston - 75,200.00 - 131,600.00 USD annually; USA, WA, BELLEVUE - 82,700.00 - 131,600.00 USD annually

Company

Amazon strives to be the world’s most customer-centric company, where customers can research and purchase anything they might want online or offline.

What you will do

  • Design complex data collections with human participants.
  • Design and conduct complex data creation tasks using synthetic and model-based data generation methods.
  • Analyze and extract insights from large amounts of data.
  • Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language.
  • Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models.

Requirements

  • Experience owning and executing language data collection projects, including guidelines, labelset and annotation workflow development.
  • Master's or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis).
  • 2+ years experience in computational linguistics or language data processing or AI data creation.
  • Experience with language data annotation systems and other forms of data markup.
  • Proficient with scripting languages, such as Python.
  • Experience working with speech, text, and multimodal data in multiple languages.
  • Excellent communication, strong organizational skills and very detailed oriented.
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment

Nice to have

  • PhD in Computational Linguistics (or equivalent field with computational emphasis).
  • Expertise in bootstrapping AI data collections for quickly evolving requirements.
  • Extensive experience working with speech, text, and multimodal data in multiple languages.
  • Experience in data creation for complex agentic workflows.
  • Practical experience with Machine Learning and technical concepts such as API.
  • Practical knowledge of version control and agile development; familiarity with database queries and data analysis processes (SQL, R, Matlab, etc.).

Culture & Benefits

  • Amazon offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage).
  • 401(k) matching.
  • Paid time off, and parental leave.