Henry Joseph — DevOps Engineer
Lead Data Engineer with around 10 years of experience designing scalable data platforms, building high-performance data pipelines, and delivering cloud-based analytics solutions across financial and enterprise environments. Experienced in Databricks, Snowflake, and Azure, with a strong focus on data governance, reliability, and cost-efficient engineering. Expertise in developing end-to-end ETL/ELT pipelines using Python and SQL, with strong experience across Azure, AWS, and GCP. Skilled in building data solutions using Databricks, BigQuery, Airflow, Azure Data Factory, and dbt to support large-scale analytics, reporting, and AI/ML workflows. Experience building GenAI-ready data platforms supporting RAG pipelines, vector search, embeddings, and AI-driven analytics workflows and leveraged LLMs to enhance data engineering workflows, including schema mapping and automated documentation, and designed AI agent-based workflows to automate data validation and transformation logic. Experienced in modernizing legacy systems into cloud-native architectures, optimizing large-scale data processing, and implementing scalable data lake and warehouse solutions. Strong focus on data quality, performance optimization, and cost-efficient pipeline design. Building reliable healthcare data pipelines that turn complex data into actionable insights. Hands-on experience with distributed data processing using Spark and PySpark, along with exposure to streaming architectures (Kafka) and AI-driven pipelines, including MLflow and LLM-based data workflows and hands-on exposure to LangChain, OpenAI embeddings, AI agent orchestration, and intelligent automation for enterprise data engineering use cases. Adept at building reliable, production-grade systems using CI/CD, Terraform, and containerization. Technologies: Python • SQL • Databricks • BigQuery • Snowflake • Airflow • Azure Data Factory • dbt • PySpark • Oracle • Windsurf • AWS • Azure • GCP • Apache Flink • Java • Kafka • Palantir • MLflow • LLM Pipelines • ETL/ELT • Power BI • Data Modeling • CI/CD • Terraform • Docker • GenAI • Delta Lake • Workday • LangChain • RAG • Vector Databases • OpenAI APIs • Kubernetes • Microsoft Fabric • Azure Synapse • Terraform
Stackforce AI infers this person is a Data Engineering expert with a strong focus on AI and cloud solutions in the Fintech sector.
Experience: 6 yrs 9 mos
Skills
- Data Engineering
- Ai/ml
- Etl/elt
Career Highlights
- Expert in building AI-ready data platforms.
- Proficient in cloud-based analytics solutions.
- Strong focus on data governance and reliability.
Work Experience
State Street
Lead Data Engineer | AI/ML (2 yrs 5 mos)
Apple
Senior Data Engineer/ETL Developer (3 yrs 11 mos)
Dell Technologies
Data Engineer/SQL Developer (2 yrs 10 mos)
Education
Master of Science - MS at Webster University
Bachelor of Technology - BTech at Christ University, Bangalore