Data Engineer

Wiley

Education
Benefits
Qualifications
Special Commitments
Skills

Location: Athens, Greece, OR Remote, Greece

Company: Atypon – A Wiley Brand

Atypon (www.atypon.com), a John Wiley & Sons company, is the world’s largest technology company in the scholarly and professional publishing industry. Headquartered in Silicon Valley, California, and spread in five different countries, Atypon is renowned for its technological leadership when it comes to management and online delivery of professional and scholarly publisher content. Atypon's online publishing platform hosts more than 50% of the English-language Science, Technical, Engineering and Mathematics (STEM) publications in the world.

Our mission is to unlock human potential. We welcome you for who you are, the background you bring, and we embrace individuals who get excited about learning. Bring your experiences, your perspectives, and your passion; it is in our differences that we empower the way the world learns.

About Wiley:

Enabling Discovery, Powering Education, Shaping Workforces.

We clear the way for seekers of knowledge: illuminating the path forward for research and education, tearing down barriers to society’s advancement, and giving seekers the help they need to turn their steps into strides.

Wiley may have been founded over two centuries ago, but our secret to success remains the same: our people. We are willing to challenge the status quo, move the needle, and be innovative. Wiley’s headquarters are located in Hoboken, New Jersey, with operations across the globe in more than forty countries.

About the Role:

Working on top of worldwide research content, actions & knowledge, our Information Discovery, Machine Learning & Data Engineering teams explore new ideas and develop novel, AI-powered solutions which help publishers & researchers to know more, do more & achieve more. At the same time, our goal is to highlight low-quality research and ensure research (data) integrity identifying signals of potential research misconduct.

We are looking for a talented and self-directed Data Engineer to work on the following areas:

  • Information Retrieval, Semantic Search, Recommendation services (on top of Elasticsearch also leveraging LLMs/Embeddings and GenerativeAI)
  • Large scale distributed data processing (based on Apache Spark / Scala)
  • Streaming data processing (based on Google DataFlow/ApacheBeam)

Senior-level ML engineers are preferred, but all levels may apply.

  • Requirement: Military obligations fulfilled for male candidates

How you will make an impact:

  • Work cross-functionally collaborating with AI/ML engineers and product management to develop AI-powered, highly efficient data & search solutions on all the above-mentioned areas. 
  • Design, build, orchestrate, and scale various streaming or batch processing data pipelines utilizing optimal patterns, technologies and frameworks.
  • Be comfortable navigating the following technologies and programming languages: Java, Python, Scala, ElasticSearch, Apache Spark, Spring Boot, Apache Beam, Airflow etc.

What we look for:

Minimum Qualifications

  • MSc with strong GPA in Computer Science/ Engineering
  • 2+ years of work experience in data, backend and/or search engineering
  • Solid background and experience in CS and S/W development (especially in backend development, databases etc)
  • Strong engineering skills and experience in Java
  • Background and experience with (at least one) of the following technologies:
  • Lucene-based search Engines (e.g., ElasticSearch)
  • Distributed Data Processing using ApacheSpark (preferably in Scala)
  • Streaming and batch processing pipelines (Apache Beam / GCP DataFlow)
  • Team player who enjoys working collaboratively
  • Fluency in English written and oral
  • Strong sense of ownership, urgency and drive

Preferred Qualifications

  • 4+ years of work experience in data, backend and/or search engineering
  • Extensive practical experience in aggregating, exploring, transforming, manipulating, processing, and indexing large scale data (cleansing, normalization, linkage, ETL)
  • Experience with Java and/or Python backend APIs / development frameworks also supporting long running tasks (e.g., Spring Boot, FastAPI, Celery), microservices architecture, IoC/DI patterns and containerization.
  • Familiarity with
  • NoSQL backends
  • Vector-based search / dense retrieval
  • Distributed message brokers (KAFKA, Google pub/sub)
  • Background in applied ML and experience in some of the following areas: Information retrieval and recommendation, text mining, NLP, representation learning, LLMs
  • Proficiency in Scala and/or Python
  • Familiarity with Cloud technologies (especially Google Cloud)
  • Familiarity with end-2-end S/W dev lifecycle (CI/CD)
  • Experience with Git, Jira, Cloud Build or Jenkins
  • Good understanding of Agile methodologies
  • DevOps Mindset

Benefits: 

  • Unique opportunity to work for a Great Place to Work Certified Silicon Valley technology company in Greece
  • Work in a team of talented engineers and tech leaders
  • Competitive compensation package
  • Private health insurance
  • Opportunity to grow in a fast moving & dynamic organization
  • Pension plan with employer contribution
  • Flexible working hours
  • Work from home possibilities (hybrid environment)
  • Merit-based performance evaluation and objective setting
  • On-the-job training and career growth opportunities
  • Early Fridays during summer
  • Extended paid parental leave
  • Employee assistance program available

#LI-JG1

#LI-REMOTE

Read Full Description
Confirmed 9 hours ago. Posted 30+ days ago.

Discover Similar Jobs

Suggested Articles