Data Engineer (Pyspark)

Virtusa

Data Engineer (Pyspark) - (CREQ225424)

Description

o Minimum 4+ years of development and design experience in experience as Data Engineer

o Hands-on experience in building PySpark

o Should be able to perform data cleansing and transformations in PySpark.

o Unit testing using Python/PySpark code

o Experience on Big Data platforms and distributed computing (e.g. Hadoop, Map/Reduce, Spark, HBase, Hive)

o Experience in data pipeline software engineering and best practice in python (linting, unit tests, integration tests, git flow/pull request process, object-oriented development, data validation, algorithms and data structures, technical troubleshooting and debugging, bash scripting )

o Experience in Data Quality Assessment (profiling, anomaly detection) and data documentation(schema,dictionaries)

Experience in data architecture, data warehousing and modelling techniques (Relational, ETL, OLTP) and consider performance alternatives

Primary Location

: IN-TN-Chennai

Schedule

: Full Time

Employee Status

: Individual Contributor

Job Type

: Experienced

Travel

: No

Job Posting

: 10/07/2025, 12:31:13 PM

Read Full Description
Confirmed 10 hours ago. Posted 3 days ago.

Discover Similar Jobs

Suggested Articles