Data Processing & Transformation
- Practical experience with ETL and ELT principles, including batch and real-time data pipelines
- Strong Python skills for data processing, transformation, and automation
- Good knowledge and hands-on experience with SQL and PL/SQL, including query optimization
- Basic to intermediate experience with Apache Spark for large-scale parallel data processing
- Practical experience with DBT for building, modeling, and testing transformation pipelines
Data Orchestration & Streaming- Experience with Apache Airflow for workflow scheduling, orchestration, and automation
- Knowledge and hands-on experience with Apache Kafka and real-time data streaming fundamentals
Data Structuring & Modelling
- Knowledge of conceptual, logical, and physical data modelling
- Experience with Data Warehouse (DWH) concepts and modelling methodologies such as Kimball/Inmon and star/snowflake schema
Infrastructure & Containerization
- Practical knowledge of Docker and containerization principles
- Basic familiarity with Kubernetes and container orchestration
Work Experience
- Minimum 3 years of professional experience in data engineering and ETL/ELT implementation
- Hands-on experience building and managing SQL-based data pipelines using Apache Spark, Apache Airflow, and DBT
- Active participation in requirement analysis, technical solution development, and implementation of best practices within teams