Machine Learning and Data Engineering Specialist

Mr. Cooper Group

Benefits
Qualifications
Skills

At Mr. Cooper Group, You Make the Dream Possible.

Our purpose is simple: Keeping the dream of homeownership alive. As a Mr. Cooper Group team member, you play a big role in making that dream possible. Around here, we know our roles and work together, volunteer to make a difference, and challenge the status quo when needed. Everything we do is in the care and service of our teammates and our customers.

Join us and make the dream of home ownership possible!

Tools & Technology:

  • Python \ R (Mandatory)
  • Databricks (Mandatory)
  • Data Lake
  • SQL

Skills:

Proficiency in machine learning algorithms and techniques. Proficiency in big data technologies such as Spark, or Kafka for processing and managing large volumes of data. Experience with distributed computing and parallel processing frameworks. Knowledge of data warehousing concepts and technologies such as Data Lakes, Google BigQuery, Snowflake, or similar platforms. Strong understanding of statistical analysis and mathematical concepts. Knowledge of deep learning architectures and neural networks. Ability to preprocess data, including feature engineering and dimensionality reduction. Experience in model evaluation, validation, and hyperparameter tuning. Familiarity with cloud computing platforms such as Azure for deploying and scaling machine learning models. Proficiency in programming languages such as Python, and R for data analysis and model development. Strong problem-solving skills and the ability to work with large, complex datasets. Excellent communication skills to collaborate with cross-functional teams and stakeholders. Understanding of software engineering principles for building scalable and maintainable machine learning systems. Experience with machine learning frameworks such as TensorFlow, PyTorch, or Scikit-learn.

Key Responsibilities:

1. Data Architecture and Design:

  • Collaborate with cross-functional teams to understand data requirements and translate them into an effective data lake and Databricks architecture.
  • Design and implement scalable and efficient data pipelines, models, and storage solutions within the Data Lake ecosystem.

2. Data Ingestion and Integration:

  • Develop robust data ingestion processes, including real-time and batch data from various sources into the Data Lake.
  • Ensure data quality and consistency through data validation and transformation procedures.

3. Data Transformation and Processing:

  • Utilize Databricks for data transformation, cleaning, and enrichment tasks.
  • Optimize data processing workflows for performance and cost-efficiency.

4. Model Development and Evaluation:

  • Develop and implement machine learning models to address specific business problems using appropriate algorithms and techniques.
  • Conduct thorough evaluation and validation of models using metrics such as accuracy, precision, recall, F1-score, ROC-AUC, etc.
  • Perform hyperparameter tuning and optimization to improve model performance and generalization.
  • Explore various modeling approaches, including traditional statistical models and advanced deep learning architectures.
  • Collaborate with domain experts and stakeholders to refine model objectives and requirements.
  • Document model development processes, methodologies, and findings for transparency and reproducibility.

5. Data Governance and Security:

  • Implement data governance policies and access controls to safeguard sensitive data.
  • Monitor data lake security and ensure compliance with data privacy regulations.

6. Performance Optimization:

  • Continuously monitor and tune the performance of Data Lake and Databricks workloads.
  • Identify and resolve bottlenecks to enhance overall data processing speed.

7. Documentation and Collaboration:

  • Maintain comprehensive documentation of data pipelines, models, and architecture.
  • Collaborate with data scientists, analysts, and stakeholders to understand their data needs and provide necessary support.

Mr. Cooper Group is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or status as a protected veteran. EOE/M/F/D/V

Job Requisition ID:

022078

Job Category:

Information Technology

Primary Location City:

Chennai

Primary Location Region:

Tamil Nadu

Primary Location Postal Code:

600089

Primary Location Country:

India

Additional Posting Location(s):

Read Full Description

Discover Similar Jobs

Suggested Articles