We are seeking an AI Engineer (Data) to join a healthcare technology organization undergoing a major modernization of its legacy platforms and data infrastructure. This role will focus on building scalable data pipelines, enabling machine learning systems, and helping integrate AI capabilities into modern healthcare applications.
You will work closely with data engineers, software engineers, and product teams to transform legacy systems into a modern AI-enabled data platform supporting analytics, automation, and intelligent healthcare workflows.
This is a hands-on engineering role focused on data pipelines, ML infrastructure, and scalable AI systems in a cloud-native environment.
Responsibilities
- Design and build scalable data pipelines that support machine learning and AI-driven applications
- Develop and maintain ML-ready data infrastructure across structured and unstructured healthcare datasets.
- Implement data ingestion, transformation, and feature engineering pipelines.
- Build services that expose ML models and AI capabilities to internal applications and APIs
- Work with engineering teams to integrate AI functionality into modernized healthcare platforms.
- Help migrate legacy data architectures to cloud-native data platforms
- Implement best practices for data governance, data quality, and observability
- Collaborate with product and engineering teams to deliver AI-powered healthcare solutions
- Contribute to the development of ML pipelines, model deployment workflows, and AI platform tooling
Required Experience
- 5+ years of experience in data engineering, machine learning infrastructure, or AI platform engineering
- Strong experience building data pipelines and distributed data systems
- Proficiency in Python for data processing and ML workflows
- Experience working with cloud platforms (AWS, GCP, or Azure)
- Experience with data processing frameworks (Spark, Airflow, or similar)
- Familiarity with ML lifecycle tools and model deployment workflows
- Experience working with large-scale structured and unstructured datasets
- Understanding of API-based architectures and microservices
- Strong problem-solving skills and the ability to work in modern distributed systems
Preferred Experience
- Experience working with healthcare data platforms or regulated environments
- Exposure to LLMs, NLP pipelines, or AI-driven applications
- Experience with feature stores, vector databases, or ML platforms
- Familiarity with modern data lake/lakehouse architectures
- Experience integrating AI services into production applications
- Experience working in modernization or platform transformation initiatives
