Key Responsibilities:
Develop and maintain data pipelines using PySpark in distributed computing environments (e.g., AWS EMR, Databricks).
Integrate and synchronize data between enterprise systems and the Reltio MDM platform.
Design and implement data transformation, cleansing, and enrichment processes.
Collaborate with data architects, business analysts, and Reltio solution architects to ensure high-quality data modeling.
Work on API-based integration between Reltio and upstream/downstream applications.
Optimize PySpark jobs for performance and cost-efficiency.
Ensure data quality, integrity, and governance throughout the pipeline.
Troubleshoot and resolve data and performance issues in existing workflows.
Required Skills & Qualifications:
5 to 7 years of experience in PySpark development and distributed data processing.
Strong understanding of Apache Spark, DataFrames, and Spark SQL.
Experience with Reltio MDM, including entity modeling, survivorship rules, match & merge configuration.
Proficiency in working with REST APIs and JSON data formats.
Experience with cloud platforms like AWS and data services (e.g., S3, Lambda, step function)
Good knowledge of data warehousing concepts, ETL workflows, and data modeling.
Familiarity with CI/CD practices and version control tools like Git.
Strong problem-solving and communication skills.
About Virtusa
Teamwork, quality of life, professional and personal development: values that Virtusa is proud to embody. When you join us, you join a team of 27,000 people globally that cares about your growth — one that seeks to provide you with exciting projects, opportunities and work with state of the art technologies throughout your career with us.
Great minds, great potential: it all comes together at Virtusa. We value collaboration and the team environment of our company, and seek to provide great minds with a dynamic place to nurture new ideas and foster excellence.
Virtusa was founded on principles of equal opportunity for all, and so does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need.