Principal Data Engineer
Quick Summary
About Sayari: Sayari is a venture-backed and founder-led global corporate data provider and commercial intelligence platform that serves financial institutions, legal and advisory service providers,
Sayari is a venture-backed and founder-led global corporate data provider and commercial intelligence platform that serves financial institutions, legal and advisory service providers, multinationals, journalists, and governments. Thousands of analysts and investigators in over 30 countries rely on our products to safely conduct cross-border trade, research front-page news stories, confidently enter new markets, and prevent financial crimes such as corruption and money laundering.
Our company culture is defined by a dedication to our mission of using open data to prevent illicit commercial and financial activity, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.
We are looking for a Principal Data Engineer to join our Data Resolution team and serve as a technical anchor for our most complex data challenges. In this role, you will be a "player-coach," spending the majority of your time (70%) hands-on with Spark and graph data logic while dedicating the remainder of your time to system architecture, design planning, and technical mentorship. You will be instrumental in evolving our graph build pipelines, optimizing our cloud footprint, and overseeing the long-term planning and execution of major data pipeline re-architectures. This is a high-impact role where your work directly powers the data products used by global systems defenders.
Responsibilities
~1 min read- →Design and implement complex Spark data logic, focusing on performance optimization, data volume tuning, and robust execution.
- →Own the architectural design of graph build pipelines, ensuring they are scalable, automated, and highly resilient.
- →Plan and oversee the strategic re-architecture of data pipelines to meet evolving business needs and scale.
- →Optimize infrastructure-as-code and schema designs to reduce cloud costs and improve pipeline latency.
- →Act as a technical consultant for the team, fostering a collaborative and engineer-led approach to design decisions.
- →Support the development of the engineering team through code reviews, design docs, and architectural best practices.
- →Ensure the accuracy of mission-critical data outputs.
Required Skills & Experience
- 8+ years of experience in the big data space, with a proven track record of implementing large-scale features and leading process redesigns.
- Expert-level mastery of Apache Spark for large-scale data processing.
- Strong experience with orchestration tools (Airflow) and cloud computing environments.
- Hands-on experience architecting and managing data flows into databases such as Elasticsearch, Memgraph, and Cassandra.
- Demonstrated ability in system architecture, including Infrastructure as Code (IaC) and schema design.
- A "builder" mindset with experience evolving and improving existing architectures to meet new scale requirements.
Preferred Skills & Experience
- Experience working specifically with graph data or graph databases.
- Prior experience with entity resolution or identity resolution systems.
- Experience evaluating and selecting modern analytical database architectures.
The target base salary for this position is $200,000-$220,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.
What We Offer
~1 min readListing Details
- Posted
- February 25, 2026
- First seen
- March 26, 2026
- Last seen
- April 18, 2026
Posting Health
- Days active
- 23
- Repost count
- 0
- Trust Level
- 45%
- Scored at
- April 18, 2026
Signal breakdown
Please let Sayari know you found this job on Jobera.
4 other jobs at Sayari
View all →Explore open roles at Sayari.
Similar Principal Data Engineer jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.
