Data Scientist
This role is Office Based, Hyderabad OfficeWe are seeking a Software Engineer to join our innovative data science & generative AI team. The ideal candidate will possess a strong technical background in data science, machine learning, and generative AI, combined with fluency in cloud infrastructure and modern data engineering best practices. In this role, you’ll drive the architecture, design, and implementation of state-of-the-art data and AI solutions that power our products and services.
In this role you will...
- Collaborate with team to design, develop, and deploy robust data science and AI solutions with a focus on generative AI (GenAI) models and frameworks (e.g., LLMs, diffusion models).
- Optimize and cloud-based infrastructure (AWS preferred – Good to have) for scalable, reliable, and secure data processing and model deployment.
- Develop and maintain data pipelines, ETL processes, and model training workflows.
- Build and implement CI/CD pipelines for machine learning models and data applications (e.g., using Jenkins, GitHub Actions, etc.).
- Optimize data storage solutions (e.g., data lakes, SQL/NoSQL databases) for performance and scalability.
- Collaborate with cross-functional teams, including product, engineering, and business stakeholders, to deliver impactful AI features and insights.
- Monitor model and system performance, troubleshoot issues, and continuously improve deployment workflows.
- Stay abreast of advancements in GenAI, MLOps, data science, and cloud technologies, bringing fresh ideas to the team.
- Contribute to technical documentation and participate in code and design reviews.
You have got what it takes if you have…
-
2+ years of hands-on experience in data science, machine learning engineering, or GenAI engineering roles.
Proficiency with at least one programming language used in data science (Python preferred; experience with Java is a plus). - Experience in developing and deploying machine learning, deep learning, or Generative AI models (e.g., LLMs, Transformers, diffusion models).
- Exposure to AWS services (e.g., S3, EC2, Lambda, SageMaker, RDS) or similar cloud platforms.
- Hands-on experience building CI/CD pipelines (e.g., with Jenkins) for machine learning and data applications.
- Solid background in SQL and/or NoSQL databases, with experience in data modeling and performance tuning.
- Strong problem-solving skills and the ability to think critically and creatively about data and AI-driven challenges.
- Excellent collaboration and communication skills, with a proven ability to work on cross-functional teams.
An extra dose of awesome if you have…
- Familiarity with front-end frameworks for AI-powered applications (e.g., React, Streamlit, Gradio).
- Experience with containerization (e.g., Docker) and orchestration (e.g., Kubernetes) for machine learning deployments.
- Experience working in Agile environments.
- Prior exposure to MLOps tools, MLflow, Airflow, or similar workflow orchestrators.
#LI-Onsite
Our Culture:
Spark Greatness. Shatter Boundaries. Share Success. Are you ready? Because here, right now – is where the future of work is happening. Where curious disruptors and change innovators like you are helping communities and customers enable everyone – anywhere – to learn, grow and advance. To be better tomorrow than they are today.
Who We Are:
Cornerstone powers the potential of organizations and their people to thrive in a changing world. Cornerstone Galaxy, the complete AI-powered workforce agility platform, meets organizations where they are. With Galaxy, organizations can identify skills gaps and development opportunities, retain and engage top talent, and provide multimodal learning experiences to meet the diverse needs of the modern workforce. More than 7,000 organizations and 100 million+ users in 180+ countries and in nearly 50 languages use Cornerstone Galaxy to build high-performing, future-ready organizations and people today.
Check us out on LinkedIn, Comparably, Glassdoor, and Facebook!