3
3Pillarglobal22h ago
New
New
Lead Data Engineer with AI experience
Data EngineerData
0 views0 saves0 applied
Quick Summary
Overview
3Pillar is an AI transformation partner on a mission to help enterprises build the AI-native products and intelligent agents that will define the next era of business. With teams across North America,
Technical Tools
Data EngineerData
3Pillar is an AI transformation partner on a mission to help enterprises build the AI-native products and intelligent agents that will define the next era of business. With teams across North America, Europe, Latin America, and Asia, we work with the most ambitious companies in financial services, healthcare, media, and technology — helping them move faster, modernize boldly, and compete on their own terms. Our HelixAI platform and Helix Pods delivery model put our engineers at the center of real agentic transformation — doing work that is open, portable, and built to last. We are building the future of enterprise AI
We are looking Lead Data Engineer to build, operate, and continuously improve the
data pipelines, retrieval infrastructure, and ML/LLMOps foundations that power our AI
initiatives. The resource will work on turning reference architectures and data contracts
into robust, production-grade implementations that serve conversational AI assistants,
dashboard copilots, autonomous agents, RAG applications, and predictive ML models.
data pipelines, retrieval infrastructure, and ML/LLMOps foundations that power our AI
initiatives. The resource will work on turning reference architectures and data contracts
into robust, production-grade implementations that serve conversational AI assistants,
dashboard copilots, autonomous agents, RAG applications, and predictive ML models.
Data Pipeline Engineering : Build, test, and maintain production pipelines (batch & real-time) on Snowflake, PySpark, Delta Lake, and Kafka.
Implement data quality checks, schema validation, and alerting at every pipeline stage.
Migrate legacy ETL/DWH to cloud-native AWS/Azure architectures with measurable latency and cost improvements.
Maintain CI/CD pipelines: automated testing, deployment, rollback, and IaC (Terraform, GitHub Actions).
RAG, Vector & Retrieval Infrastructure: Build end-to-end retrieval infrastructure: document ingestion, embedding pipelines, vector store management (Pinecone, FAISS, ChromaDB, OpenSearch), and hybrid retrieval layers.
Implement chunking, metadata filtering, and re ranking — tuning for precision, recall, and latency.
Maintain data freshness and index consistency; instrument with context relevance and faithfulness metrics.
Semantic Layer & Knowledge Infrastructure: Implement and maintain business entity mappings, ontologies, and knowledge graphs (Neo4j) per Architect design.
Build and version the feature store and semantic data contracts serving both ML models and LLM applications.
Manage metadata, data lineage, and audit trail instrumentation across the platform.
ML/LLMOps Pipeline Support: Build ML data infrastructure: training curation, feature engineering, MLflow experiment tracking, dataset versioning.
Support LLM fine-tuning workflows — corpus curation, quality filtering, dataset formatting.
Implement automated evaluation pipelines: factual accuracy, hallucination detection, regression tracking.
Maintain production monitoring dashboards for pipeline health, model metrics, and alerting.
Agentic Data Infrastructure: Build and maintain data APIs, tool schemas, and memory/state stores that autonomous agents depend on.
Implement agent observability: capture inputs, retrieved context, tool calls, reasoning traces, and outputs.
Maintain text-to-SQL layers, semantic query interfaces, and context APIs for conversational AI consumers.
Governance, Security & Data Quality: Implement RBAC, attribute-based access, PII detection/masking, data classification, and audit logging.
Enforce data contracts and schema governance with automated breaking-change detection and versioned migrations.
Build data quality monitoring (completeness, freshness, consistency) with automated alerting and root-cause tooling.
Support compliance readiness: audit trails, data provenance, and regulatory documentation.
aligned engineering.
Secondary Skills : LangChain, LlamaIndex, LLM APIs (OpenAI, Bedrock, Claude, HuggingFace), Pinecone, FAISS, ChromaDB, OpenSearch, MLflow, FastAPI, Neo4j, LangGraph, prompt engineering, RLHF dataset prep, LLM fine-tuning workflows
Regards,
Kiran Dhanak
Talent Acquisition Manager
Location & Eligibility
Where is the job
India
Remote within one country
Who can apply
IN
Listing Details
- Posted
- June 12, 2026
- First seen
- June 12, 2026
- Last seen
- June 13, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 68%
- Scored at
- June 12, 2026
Signal breakdown
freshnesssource trustcontent trustemployer trust
External application · ~5 min on 3Pillarglobal's site
Please let 3Pillarglobal know you found this job on Jobera.
3 other jobs at 3Pillarglobal
View all →Explore open roles at 3Pillarglobal.
Similar Data Engineer jobs
View all →Newsletter
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
A
B
C
D
No spam. Unsubscribe at any time.
3
Lead Data Engineer with AI experience