Why Socure?
At Socure, we’re on a mission—to verify 100% of good identities in real time and eliminate identity fraud from the internet.
Using predictive analytics and advanced machine learning trained on billions of signals to power RiskOS™, Socure has created the most accurate identity verification and fraud prevention platform in the world. Trusted by thousands of leading organizations—from top banks and fintechs to government agencies—we solve real, high-impact problems at scale. Come join us!
About the Role
Data lies at the heart of everything we do at Socure, and establishing meaningful relationships between data elements is key to the performance of our products. We are seeking an experienced and highly motivated Principal Data Engineer to lead the design and development of our new Identity Graph project, a greenfield initiative that will form the foundation of all our products. You will lead a team of talented data and backend engineers while playing a key role in advancing Socure’s identity infrastructure to power the next generation of identity fraud products. Your work will directly impact the scalability, reliability, and performance of solutions critical to Socure’s success.This project has the highest visibility and backing including the CEO, CTO and CPO
What You'll Do
- Lead the architectural design and implementation of scalable, reliable, and secure data platform solutions.
- Provide mentorship and guidance to engineers through design reviews, code reviews, etc.
- Establish and enforce best practices in data engineering, including coding, testing, and deployment.
- Design and implement solutions using modern data lakes, warehouses, and graph databases to model complex data relationships.
- Implement robust testing frameworks and CI/CD pipelines to support continuous delivery of data products.
- Own the end-to-end development of complex data ETL pipelines for batch and real-time processing, supporting both model training and inference. Ensure data quality and consistency across all pipelines.
- Oversee the development of low latency, scalable, and performant APIs to query the graph and generate ML features.
- Partner with legal and security teams to ensure compliance with PII handling, security, and regulatory requirements.
- Effectively communicate complex technical concepts to both technical and non-technical stakeholders.
- Stay current with emerging technologies and trends in big data, graph databases, and cloud computing.
- Contribute to open-source projects and represent Socure in technical communities.
What You Bring
- 10-15+ years of professional experience in data engineering, including 4+ years in a lead role.
- Proven ability to lead and grow high-performing technical teams.
- Expertise in building large-scale data solutions in modern cloud platforms (AWS, GCP, or Azure).
- Proficiency in programming languages such as Python, SQL, and one or more of Java, Scala, or Go.
- Strong experience with data lakes, data warehouses, and streaming frameworks (e.g., Apache Spark, Kafka, Flink).
- Hands-on experience with graph databases and algorithms (e.g., Neo4j, Amazon Neptune, or OrientDB).
- Strong understanding of software architecture, design patterns, and data structures.
- Experience with containerization technologies (Docker, Kubernetes) and microservices architecture.
- Well versed in developing highly available, scalable low latency online systems.
- Familiarity with DevOps practices and CI/CD pipelines.
Bonus Points
- Experience in building ML feature data pipelines using Databricks or similar ecosystems.
- Knowledge of cybersecurity best practices and regulatory compliance.
- Deep understanding of graph algorithms and data processing techniques.
- Prior experience in developing and working with Graphql APIs
- Expertise in JVM based language, performance tuning and caching
- Experience with identity resolution, or fraud detection systems
Socure is an equal opportunity employer and values diversity of all kinds at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Follow Us!
YouTube | LinkedIn | X (Twitter) | Facebook
Read Full Description