Design, develop, test, and maintain a robust Python package to structure data processing and upload tasks.
Implement tools within the package to handle diverse data streams, ensuring data quality for simulation inputs.
Analyze existing data collection and processing workflows, identify bottlenecks, and proactively propose and implement improvements.
Contribute your own ideas to enhance the overall data flow, tooling, and efficiency of our processes.
Collaborate closely with Market Analytics team members and potentially other data-providing teams to understand requirements.
Utilize our tech stack (Python, SQL, Git/GitHub, Databricks, Azure, Snowflake) to build and deploy solutions.
Document the Python package and associated data processes.
Your profile
Currently enrolled in a Bachelor's or Master's program in Computer Science, Data Science, Engineering, Information Technology, Mathematics, or a related field.
Strong Python programming skills and a good understanding of software development best practices (e.g., package structure, testing).
Solid understanding of SQL for data querying and manipulation.
Experience with version control using Git and platforms like GitHub.
Experience with or a strong interest in cloud platforms (specifically Azure) and data processing/warehousing tools (Databricks, Snowflake) is advantageous.
Excellent analytical and problem-solving skills with an ability to understand and improve complex processes.
Self-motivated, eager to learn, and comfortable working independently and contributing your own ideas.
Good communication skills in English (both written and verbal).