Data Engineer
Apply for this position → Go ad-free with PremiumPosition Summary: MedReview Innovation and Development team is seeking a data engineer to function as the primary architect and operator of our data infrastructure. Your mission is to evolve our current environment into a rapid-acquisition engine capable of feeding real-time ML models, innovation, and operations while maintaining rigorous healthcare compliance standards. Responsibilities:
Pipeline Architecture: Design, implement, and maintain end-to-end data pipelines on Azure, ensuring high availability and low latency for healthcare claim and analytics processing.
High-Performance Storage: Manage and optimize ClickHouse as our primary analytical engine, focusing on rapid data ingestion and lightning-fast query performance for large-scale datasets.
ML Data Readiness: Structure data environments to support the full ML lifecycle, from feature engineering and training to real-time model inference.
MLOps Integration: Collaborate with Data Scientists to implement automated CI/CD pipelines for model deployment, monitoring, and retraining.
Rapid Acquisition: Develop scalable frameworks to ingest diverse healthcare data sources (EDI, claims, clinical notes) with high velocity.
Security & Compliance: Ensure all data structures and processes adhere to HITRUST/HIPAA standards, collaborating with IT and the leads for technical efforts for HITRUST certification readiness.
Required Skills & Experience
Cloud Expertise: 5+ years of experience in data engineering, with deep proficiency in Azure Data Factory, Azure Databricks, or Azure Synapse.
OLAP Mastery: Proven experience managing and tuning ClickHouse (or similar columnar databases like Druid/Pinot) for massive datasets.
Programming: Expert-level Python and SQL skills.
ML Engineering: Familiarity with ML frameworks (PyTorch, TensorFlow) and MLOps tools (MLflow, Kubeflow, or Azure Machine Learning).
Healthcare Domain: Prior experience with healthcare data formats (HL7, FHIR, 835/837) and a strong understanding of HITRUST/HIPAA security requirements.
Scale-up Mindset: Ability to build 'v1' processes while designing for 10x growth.
Preferred Qualifications:
Experience with Infrastructure as Code (Terraform, Bicep).
Knowledge of stream processing (Kafka, Azure Event Hubs).
Background in financial or payment integrity analytics.
Salary: 105,000 - 115,000
Similar Jobs
Principal Machine Learning & Data Engineer
Twilio · USA
Staff Fullstack Software Engineer, Core Performance
Dropbox · USA
Senior Forward Deployed Software Engineer
H1 · USA
Software Engineer L5, Python Platform
Netflix · USA
Sr. Machine Learning Engineer I
HubSpot · USA
Data Engineer
Position Summary: MedReview Innovation and Development team is seeking a data engineer to function as the primary architect and operator of our data infrastructure. Your mission is to evolve our current environment into a rapid-acquisition engine capable of feeding real-time ML models, innovation, and operations while maintaining rigorous healthcare compliance standards. Responsibilities:
Pipeline Architecture: Design, implement, and maintain end-to-end data pipelines on Azure, ensuring high availability and low latency for healthcare claim and analytics processing.
High-Performance Storage: Manage and optimize ClickHouse as our primary analytical engine, focusing on rapid data ingestion and lightning-fast query performance for large-scale datasets.
ML Data Readiness: Structure data environments to support the full ML lifecycle, from feature engineering and training to real-time model inference.
MLOps Integration: Collaborate with Data Scientists to implement automated CI/CD pipelines for model deployment, monitoring, and retraining.
Rapid Acquisition: Develop scalable frameworks to ingest diverse healthcare data sources (EDI, claims, clinical notes) with high velocity.
Security & Compliance: Ensure all data structures and processes adhere to HITRUST/HIPAA standards, collaborating with IT and the leads for technical efforts for HITRUST certification readiness.
Required Skills & Experience
Cloud Expertise: 5+ years of experience in data engineering, with deep proficiency in Azure Data Factory, Azure Databricks, or Azure Synapse.
OLAP Mastery: Proven experience managing and tuning ClickHouse (or similar columnar databases like Druid/Pinot) for massive datasets.
Programming: Expert-level Python and SQL skills.
ML Engineering: Familiarity with ML frameworks (PyTorch, TensorFlow) and MLOps tools (MLflow, Kubeflow, or Azure Machine Learning).
Healthcare Domain: Prior experience with healthcare data formats (HL7, FHIR, 835/837) and a strong understanding of HITRUST/HIPAA security requirements.
Scale-up Mindset: Ability to build 'v1' processes while designing for 10x growth.
Preferred Qualifications:
Experience with Infrastructure as Code (Terraform, Bicep).
Knowledge of stream processing (Kafka, Azure Event Hubs).
Background in financial or payment integrity analytics.
Salary: 105,000 - 115,000
Similar Jobs
Principal Machine Learning & Data Engineer
Twilio · USA
Staff Fullstack Software Engineer, Core Performance
Dropbox · USA
Senior Forward Deployed Software Engineer
H1 · USA
Software Engineer L5, Python Platform
Netflix · USA
Sr. Machine Learning Engineer I
HubSpot · USA