Data Engineer (Remote, US)
Quick Summary
About Sayari: Sayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. The Sayari Commercial World Model resolves 11.
Sayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. The Sayari Commercial World Model resolves 11.7B+ primary-source records from 250+ jurisdictions forming the ground truth of global commerce. A Judgment Ontology, encoding over a decade of investigative tradecraft, and Superconductor, an agentic orchestration platform, deliver AI that reasons like an expert analyst, shows its work, and traces every finding to its source. Trusted by U.S. Customs and Border Protection, HM Revenue & Customs, and Fortune 500 enterprises, Sayari is used by thousands of professionals across 35+ countries to secure supply chains and dismantle illicit networks. Headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.
As a Data Engineer at Sayari, you will be the engine behind the world’s most comprehensive commercial world model. You will join a high-autonomy team responsible for building and scaling the complex orchestration systems that transform billions of primary-source records into actionable intelligence. This is a role for a "builder" who respects the complexity of large-scale ETL and graph databases and is "PhD-curious" about the future of AI-native data products and modern orchestration.
Responsibilities
~1 min read- →Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow to support our core data acquisition and entity resolution engines.
- →Collaborate cross-functionally with AI/ML and Product teams to implement new features and AI-native products.
- →Proactively identify and resolve bottlenecks in our complex ETL processes, bringing a fresh perspective to refine and optimize our existing codebase.
- →Contribute to a robust engineering culture through rigorous code reviews, unit testing, and clear communication of design decisions.
- →Own the end-to-end delivery of roadmap tasks within two-week sprints, ensuring work meets high standards for quality, documentation, and performance.
- →Participate in roadmap planning and story refinement, eventually taking ownership of major epics that drive our long-term product defensibility.
- Professional proficiency in Python and experience contributing to shared codebases using Git (branching, PRs, code reviews).
- Demonstrated experience working with relational databases (PostgreSQL/BigQuery) and an interest in or familiarity with graph databases.
- Familiarity with distributed computing (Spark) or a strong desire to master it.
- Strong collaborative skills and the ability to work effectively in an Agile, sprint-based environment.
- A "self-directed" orientation: ability to move tasks from "assigned" to "complete" with high autonomy and clear communication.
Nice to Have
~1 min read- Experience with Django, Scala, or Scrapy.
- Hands-on experience with workflow orchestration tools like Airflow.
- Experience or strong interest in LLM tuning, deployment, and AI engineering best practices.
- Experience working with international or non-English datasets.
- Prior experience working with high-scale, complex data pipelines.
The target base salary for this position is $90,000-$120,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.
What We Offer
~1 min readLocation & Eligibility
Listing Details
- Posted
- May 26, 2026
- First seen
- May 26, 2026
- Last seen
- May 29, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 87%
- Scored at
- May 26, 2026
Signal breakdown
Please let Sayari know you found this job on Jobera.
3 other jobs at Sayari
View all →Explore open roles at Sayari.
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.
