Lead Data Engineer
at Thoughtworks
Ho Chi Minh City
At the time of application, candidates must be Vietnam citizens
Lead data engineers at Thoughtworks develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions. They might spend a few weeks with a new client on a deep technical review or a complete organizational review, helping them to understand the potential that data brings to solve their most pressing problems. On projects, they will be leading the design of technical solutions, or perhaps overseeing a program inception to build a new product. Alongside hands-on coding, they are leading the team to implement the solution.
Job responsibilities
- You will lead and manage data engineering projects from inception to completion, including goal-setting, scope definition and ensuring on-time delivery with cross team collaboration.
- You will collaborate with stakeholders to understand their strategic objectives and identify opportunities to leverage data and data quality.
- You will design, develop and operate modern data architecture approaches to meet key business objectives and provide end-to-end data solutions.
- You will be responsible to create, design and develop intricate data processing pipelines, addressing clients' most challenging problems.
- You will collaborate with data scientists to design scalable implementations of their models.
- You write clean and iterative code based on TDD and leverage various continuous delivery practices to deploy, support and operate data pipelines.
- You will lead and advise clients on how to use different distributed storage and computing technologies from the plethora of options available.
- You will develop data models by selecting from a variety of modeling techniques and implementing the chosen data model using the appropriate technology stack.
- You will be responsible for data governance, data security and data privacy to support business and compliance requirements.
- You will define the strategy for and incorporate data quality into your day-to-day work.
Job qualifications
Technical Skills
- Extensive experience with designing and delivering solutions using Databricks (SparkSQL, PySpark, Spark DataFrames) and open table format design & optimisations (Delta Lake, Apache Iceberg).
- Extensive experience designing modern data architecture patterns (Medallion, data mesh, data product approach).
- Extensive experience designing and developing dbt (core/cloud) and advanced SQL for analytical transformations, streaming and batch data ingestion frameworks and using orchestration/workflow tools to schedule and sequence pipelines.
- Experience with multi-cloud architectures and cross-cloud data integration.
- Attained a Cloud Data Engineer certification (GCP, AWS, or Azure) or a Databricks certification (Data Engineer Associate).
- Experience with building DevOps solutions (GitLab/GitHub CI/CD, Terraform) and containerisation.
- Experience in designing and delivering data quality, testing, and governance (Great Expectations, dbt tests, data contracts).
- Proficient with Python engineering skills with software engineering and design pattern best practices.
- Proficient with use of LLMs and associated technologies such as vector databases and graph databases for semantic search.
- Exceptional consulting-level English communication skills for strategic stakeholder engagement (negotiating, influencing, presenting, and resolving complex issues at an executive level).
Professional Skills
- Demonstrated experience in leading large data teams, driving collaboration with business, analysts, and data scientists, and influencing technical direction.
- Proven ability in data product design and domain-driven design in data platforms.
- Solid experience with machine learning pipelines and MLOps (MLflow, Vertex AI, SageMaker, Azure ML).
- Hands-on experience with real-time analytics and low-latency serving layers (e.g., Apache Flink, Materialize, Rockset).
- Practical experience with vector databases (Pinecone, Weaviate, ChromaDB) or semantic search in AI workflows.
Other things to know
Learning & Development
There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you. But we also balance autonomy with the strength of our cultivation culture. This means your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. We see value in helping each other be our best and that extends to empowering our employees in their career journeys.
About Thoughtworks
Thoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we’re pushing boundaries through our purposeful and impactful work. For 30+ years, we’ve delivered extraordinary impact together with our clients by helping them solve complex business problems with technology as the differentiator. Bring your brilliant expertise and commitment for continuous learning to Thoughtworks. Together, let’s be extraordinary.
