Data Engineer

IT Services

upstox

We are looking for a savvy Data Engineer to join our growing team of analytics experts at Upstox. The hire will be responsible for expanding and optimising our data and data pipeline architecture, as well as optimising data flow and collection for cross functional teams. The ideal candidate is an experienced data pipeline builder and data wrangler who enjoys optimising data systems and building them from the ground up.
You will be responsible for –
– Creating complex data processing pipelines, as part of diverse, high energy teamsDesigning scalable implementations of the models developed by our Data Scientists.
– Hands-on programming based on TDD, usually in a pair programming environment.
– Deploying data pipelines in production based on Continuous Delivery practicesCreate and maintain clear documentation on data models/schemas as well as transformation/validation rules.
– Troubleshoot and remediate data quality issues raised by pipeline alerts or downstream consumer.
– Engage with stakeholders to gather requirements to deliver data solutions.
Advising clients on the usage of different distributed storage and computing technologies from the plethora of options available in the ecosystem
Ideally, you should have –
– Good understanding on building and deploying large scale data processing pipelines in a production environment
– Experience building data pipelines and data centric applications using distributed storage platforms like HDFS, S3, NoSql databases (Hbase, Cassandra, etc) and distributed processing platforms like Hadoop, Spark, Hive, Oozie, Airflow, etc in a production settingHands on experience in MapR, Cloudera, Hortonworks and/or Cloud (AWS EMR, Azure HDInsights, Qubole etc.) based Hadoop distributions.
– Strong communication and client-facing skills with the ability to work in a consulting environment is essential·
Desired Skills and Experience
– Comfortable working in Linux environment.
– Fluent in programming languages like Nodejs/Java/Python/AWS
– SQL (Expert Level)Hands-on Experience in Distributed Processing platforms such as AWS EMR, MapR, Cloudera
– Distributed storage platforms like HDFS, S3, NoSql databases

To apply for this job please visit jobs.lever.co.

Similar Jobs to Apply
  • deshawindia
    Hyderabad

    We are looking for a Tech Associate for our Quality and Test Engineering team. The team is responsible for ensuring the quality of different kinds of applications. What you’ll do
  • Accenture
    Bangaluru, Hyderabad, Chennai, Coimbatore, Gurugram, Pune, Kolkata, Nagpur, Indore, Mumbai, Jaipur

    Role Overview Join our team that improve the way our clients and the world works. Working in challenging and dynamic environments, using their versatility to create and support solutions that meet cl
  • e-consystems
    Chennai

    What you will do: Wholistically understand the product positioning & then research keywords accordingly and implement them effectively Execute both on-page and off-page SEO Optimization
  • paytm
    KOLKATA, WEST BENGAL

    About Us: Paytm is India's leading financial services company that offers full-stack payments & financial solutions to consumers, offline merchants, and online platforms. The company is on a miss