Data Engineer/Cloud Data Engineer About Yugen Founded in 2020 by ex-Silicon Valley, IIT Alumni with prior experience in Tech Consulting and Product Management, Yugen is an early-stage startup in the Data Science and Machine Learning Engineering space. Simultaneous adoption of Algorithms, Engineering, and Responsible AI will shape the future, and Yugen’s vision is to be at the forefront of that trend. We are working towards becoming India’s leading player in the Data Science and MLOps consulting space over the next 4-6 years. About the Role and Responsibilities We're looking for Data Engineers who will work hands-on and manage technology projects across multiple client engagements. As a Data Engineer, you will be part of the Engineering and Technology function and eventually play a central role in building our MLOps platform. Your key responsibilities will be to - ● Translate business requirements into scalable technical solutions ● Design, build and manage Data Ingestion, transformation and exploration pipelines designed for high throughput/low latency ● Deploy a range of data engineering pipelines into production ● Collaborate with other ML Engineers and Data Scientists during System Design & implementation/maintenance of Feature Stores and Model Monitoring ● Be result oriented and emphasise on on-time delivery of projects ● Communicate potential roadblocks and trade-offs between performance and complexity upfront ● Mentor other Data Engineers and enable their growth Yugen is currently an early-stage startup with a fast-growing client portfolio. We create scalable and reliable ML Systems and therefore Engineering is an extremely crucial part of our work. We are strongly committed to the ‘Every promising model should face the test of reality’ philosophy. Key Qualifications Must Haves ● Knowledge and experience of deploying and maintaining large scale advanced systems in production environments ● 2+ years of hands on experience in building Distributed Computing - Spark/PySpark, Scala and Kafka ● Strong programming skills in Python & SQL ● Experience in handling architectural and design considerations such as performance, scalability, reusability and flexibility issues ● 2+ years of hands-on experience in Cloud computing platforms (any one of AWS/Azure/GCP) ● Workflow management such as Airflow ● Linux environment, SQL & NoSQL databases and Shell scripting ● Clear understanding of Git/version control Desirable ● Hands-on experience on deploying ML models ● Knowledge of feature stores ● Knowledge of deployments in Kubernetes containers