Website ETCIO - Cloud Data Center
Proximity Works – Data Engineer ( 3-5 yrs ) — Data Engineer – Navi Mumbai Build and scale data pipelines that feed AI models, analytics, and business decisions for Bharat AI’s agentic platform. You’ll turn raw event streams into reliable, insight‑ready datasets that power product growth, revenue insights, and AI evaluation. What You’ll Do Design and maintain scalable batch and streaming pipelines for user events and system logs. Create canonical, analytics‑ready datasets tracking growth, engagement, cohorts, and conversion funnels. Work with product, data science, finance and research teams to translate business questions into trustworthy data models. Implement fault‑tolerant ingestion, transformation, and processing systems on Spark and cloud platforms. Participate in data‑architecture decisions, balancing scalability, cost, latency and analytical flexibility. Ensure data security, integrity and compliance with company and industry standards. Monitor pipeline health, troubleshoot failures, and continuously improve performance and data quality. What You Need 3–5 years experience as a Data Engineer or similar role. Strong Python proficiency for data processing and orchestration. Hands‑on expertise with Apache Spark (writing, debugging, optimizing jobs). Experience building and operating distributed data pipelines for analytics or ML. Ability to collaborate with cross‑functional teams and address diverse data requirements. Solid understanding of data‑pipeline design, batching, streaming, and storage systems. Good to Have Production experience with Databricks. Familiarity with GCP stack: Pub/Sub, Dataflow, BigQuery, and Cloud Storage. Exposure to data‑quality, validation, or schema‑management frameworks. The Opportunity You’ll join a fast‑growing AI engineering team where every data system you build directly
To apply for this job please visit in.linkedin.com.
