Principal Solutions Architect - Data Engineering

Full-time
India
Senior Level
Posted 11 hours ago
Apply for this position → Go ad-free with Premium ×

Join phData, a remote-first data and AI consultancy company with employees across the United States, Latin America, and India. We partner with industry leaders, including Snowflake, AWS, Anthropic, Azure, GCP, Fivetran, Pinecone, Glean, and dbt, to solve the complex data and AI challenges that slow large enterprises.

We're growing fast, and we give our people real ownership over their work. We hire top performers and trust them to deliver results.

Why phData?

Principal Solutions Architect - Data Engineering

Join phData, a dynamic and innovative leader in the modern data stack. We partner with major cloud data platforms, including Snowflake, AWS, Azure, GCP, Fivetran, Pinecone, Glean, and dbt, to deliver cutting-edge services and solutions. We're committed to helping global enterprises overcome their toughest data challenges. 

phData is a remote-first global company with employees based in the United States, Latin America, and India. We celebrate the cultures of our team members and foster a community of technological curiosity, ownership, and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.

Recognized as an award-winning workplace in the US, India, and LATAM

Required Experience:

  • 15-20 years of Experience with 8+ years as a hands-on Solutions Architect designing and implementing data solutions
  • Consulting leadership experience working with external customers, with the ability to multitask, prioritize tasks, frequently change focus, and work across a variety of projects. 
  • Strong programming expertise in Python and/or Scala; Java experience is a plus 
  • Core cloud data platforms, including Snowflake, AWS, Azure, Databricks, or GCP
  • SQL and the ability to write, debug, and optimize SQL queries
  • Modern data transformation frameworks: dbt (data build tool) for ELT pipeline development, testing, and documentation within cloud data warehouses
  • Demonstrated expertise in effectively leading and managing a team comprising Solution Architects and Data Engineers, fostering internal growth through coaching, mentoring, and performance management.
  • Proven track record of collaborating with client stakeholders, technology partners, and cross-functional sales and delivery team members across distributed global teams, ensuring seamless, successful project delivery outcomes.
  • Create strong cross-practice relationships to drive customer success.
  • Exhibits a strong sense of ownership in resolving challenges, committed to ensuring exceptional outcomes for all aspects of project execution.
  • Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
  • Client-facing written and verbal communication skills and experience
  • Create and deliver detailed presentations 
  • Detailed solution documentation (e.g., including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
  • 4-year Bachelor's degree in Computer Science or a related field, or a Master's in Computer Applications or equivalent.

Prefer any of the following: 

  • Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Databricks, legacy big data platforms such as Hadoop/HDFS
  • Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra, or other NoSQL storage systems
  • Data integration technologies: Spark, Kafka, Apache Flink, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc, or other data integration technologies
  • Multiple data sources: Experience designing multi-source integration patterns across structured, semi-structured, and unstructured data — including APIs, CDC/event streams, relational databases, NoSQL systems, and file-based sources
  • Complete software development life cycle experience: including design, documentation, implementation, testing, and deployment
  • Automated data transformation and data curation: Spark, Spark streaming, automated pipelines
  • Data observability and quality frameworks: Experience with tools such as Great Expectations, Monte Carlo, or Acceldata; ability to embed data quality checks, lineage tracking, schema monitoring, and freshness alerting into pipeline design
  • AI/ML Pipeline Readiness: Experience designing data pipelines and feature stores that support ML/AI workloads; familiarity with MLflow, AWS SageMaker, or Azure ML is a plus
  • Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi
  • Open table formats: Apache Iceberg, Delta Lake (Databricks), Apache Hudi
  • Methodologies: Agile Project Management, Data Modeling (e.g., Kimball, Data Vault)

Why phData? We Offer:

  • Remote-First Workplace
  • Medical Insurance for Self & Family
  • Medical Insurance for Parents
  • Term Life & Personal Accident
  • Wellness Allowance
  • Broadband Reimbursement
  • Continuous learning and growth opportunities to enhance your skills and expertise
  • Other benefits include paid certifications, professional development allowance, and bonuses for creating company-approved content

phData celebrates diversity and is committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at phData. We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at People Operations.

Go ad-free with Premium ×
Apply for this position →
Check if your resume is a good fit
25/100
Get Full Report
+ 1,284 new jobs added today
30,000+
Remote Jobs

Don't miss out — new listings every hour

Join Premium

Principal Solutions Architect - Data Engineering

Join phData, a remote-first data and AI consultancy company with employees across the United States, Latin America, and India. We partner with industry leaders, including Snowflake, AWS, Anthropic, Azure, GCP, Fivetran, Pinecone, Glean, and dbt, to solve the complex data and AI challenges that slow large enterprises.

We're growing fast, and we give our people real ownership over their work. We hire top performers and trust them to deliver results.

Why phData?

Principal Solutions Architect - Data Engineering

Join phData, a dynamic and innovative leader in the modern data stack. We partner with major cloud data platforms, including Snowflake, AWS, Azure, GCP, Fivetran, Pinecone, Glean, and dbt, to deliver cutting-edge services and solutions. We're committed to helping global enterprises overcome their toughest data challenges. 

phData is a remote-first global company with employees based in the United States, Latin America, and India. We celebrate the cultures of our team members and foster a community of technological curiosity, ownership, and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.

Recognized as an award-winning workplace in the US, India, and LATAM

Required Experience:

  • 15-20 years of Experience with 8+ years as a hands-on Solutions Architect designing and implementing data solutions
  • Consulting leadership experience working with external customers, with the ability to multitask, prioritize tasks, frequently change focus, and work across a variety of projects. 
  • Strong programming expertise in Python and/or Scala; Java experience is a plus 
  • Core cloud data platforms, including Snowflake, AWS, Azure, Databricks, or GCP
  • SQL and the ability to write, debug, and optimize SQL queries
  • Modern data transformation frameworks: dbt (data build tool) for ELT pipeline development, testing, and documentation within cloud data warehouses
  • Demonstrated expertise in effectively leading and managing a team comprising Solution Architects and Data Engineers, fostering internal growth through coaching, mentoring, and performance management.
  • Proven track record of collaborating with client stakeholders, technology partners, and cross-functional sales and delivery team members across distributed global teams, ensuring seamless, successful project delivery outcomes.
  • Create strong cross-practice relationships to drive customer success.
  • Exhibits a strong sense of ownership in resolving challenges, committed to ensuring exceptional outcomes for all aspects of project execution.
  • Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
  • Client-facing written and verbal communication skills and experience
  • Create and deliver detailed presentations 
  • Detailed solution documentation (e.g., including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
  • 4-year Bachelor's degree in Computer Science or a related field, or a Master's in Computer Applications or equivalent.

Prefer any of the following: 

  • Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Databricks, legacy big data platforms such as Hadoop/HDFS
  • Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra, or other NoSQL storage systems
  • Data integration technologies: Spark, Kafka, Apache Flink, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc, or other data integration technologies
  • Multiple data sources: Experience designing multi-source integration patterns across structured, semi-structured, and unstructured data — including APIs, CDC/event streams, relational databases, NoSQL systems, and file-based sources
  • Complete software development life cycle experience: including design, documentation, implementation, testing, and deployment
  • Automated data transformation and data curation: Spark, Spark streaming, automated pipelines
  • Data observability and quality frameworks: Experience with tools such as Great Expectations, Monte Carlo, or Acceldata; ability to embed data quality checks, lineage tracking, schema monitoring, and freshness alerting into pipeline design
  • AI/ML Pipeline Readiness: Experience designing data pipelines and feature stores that support ML/AI workloads; familiarity with MLflow, AWS SageMaker, or Azure ML is a plus
  • Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi
  • Open table formats: Apache Iceberg, Delta Lake (Databricks), Apache Hudi
  • Methodologies: Agile Project Management, Data Modeling (e.g., Kimball, Data Vault)

Why phData? We Offer:

  • Remote-First Workplace
  • Medical Insurance for Self & Family
  • Medical Insurance for Parents
  • Term Life & Personal Accident
  • Wellness Allowance
  • Broadband Reimbursement
  • Continuous learning and growth opportunities to enhance your skills and expertise
  • Other benefits include paid certifications, professional development allowance, and bonuses for creating company-approved content

phData celebrates diversity and is committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at phData. We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at People Operations.