Job Summary
In this role, your core responsibility will be managing data, designing robust ETL pipelines, and building our next-generation Data Warehouse and Data Lake. Beyond core Data Engineering, you will act as a bridge to our AI/ML initiatives by implementing specialized data pipelines for AI/ML, conducting data analysis, and integrating AI models. If you have a strong foundational background in data engineering and a passion for AI/ML technologies, this role is for you.
Job Responsibilities*
Core Data Engineering & Infrastructure:
- Big Data Management: Design, develop, and maintain scalable data infrastructure to manage and process large-scale datasets.
- ETL Pipeline Implementation: Architect and optimize streaming and batch ETL pipelines to ensure efficient data ingestion and transformation.
- Data Warehouse & Lakehouse: Collaborate on the design, creation, and optimization of the company’s Data Warehouse and Data Lake/Lakehouse architectures.
- Data Performance Optimization: Implement data compression, partitioning, and indexing techniques to optimize storage costs and query speeds.
Data Analysis & Reporting:
- Reporting & Dashboards: Work closely with business stakeholders to analyze data requirements and build semantic layers/data marts.
- Metabase Integration: Design and optimize SQL queries behind Metabase to deliver fast, interactive, and actionable dashboards for cross-functional teams.
AI Engineering:
- Data Pipelines for Data Science: Implement specialized, high-performance data pipelines specifically designed to feed Machine Learning models and AI workflows.
- AI Model Deployment & Integration: Assist in evaluating, deploying, and optimizing AI/ML models (including Generative AI/LLM APIs and predictive models) into production environments. AI Engineering: Partner with senior engineers on technical requirements, model testing, and deployment best practices, building toward end-to-end ownership of AI/ML solutions as the data foundation matures.
Qualifications
- Bachelor's degree in Computer Engineering, Computer Science, or a related technical field.
-
3+ years of experience in Software Engineering, Data Engineering, Data Pipeline implementation, or a related data-centric role.
-
Strong programming skills in Python and proficient in SQL (including Window functions and Query Optimization).
-
Experience with Big Data tools and distributed computing frameworks.
-
Hands-on experience or strong theoretical knowledge of ETL processes.
-
Solid understanding of basic Machine Learning concepts, algorithms, and data preprocessing for ML.
-
Experience or strong interest in Generative AI models (LLM APIs, Prompt engineering, NLP) and model optimization.
-
Excellent communication skills to explain technical data concepts and insights to diverse audiences.
-
Ability to work effectively in a fast-paced, agile environment.
-
Good command on both English and Thai.