Analyze large and complex datasets from unstructured data, logs, and production records.
Build LM-based applications such as Retriever-Augmented Generation (RAG) to support different teams within the organization.
Create synthetic data for language models, ensuring diversity and representativeness in datasets.
Conduct and contribute to experiments, write reusable code.
Deploy ML/DL models into production.
Work closely with operation and production teams to understand their data needs and provide tailored analytical solutions.
Collaborate with cross-functional teams within the Data Science, Data Engineering, and Data Analytics divisions to ensure data pipelines, models, and analyses are aligned with business needs.
Contribute to innovation by staying updated on the latest industry trends in data science, machine learning, and AI, and applying them to upstream challenges.
Xüsusi tələblər
Bachelor’s or master’s degree in STEM, or equivalent practical experience (3+ years).
Proficiency in Python programming language.
Strong understanding of classical machine learning and deep learning models.
Experience with relational and non-relational databases (e.g., SQL, data pipelines, data lakes).
Experience with Large Language Models, NLP, or Generative AI.
Experience with Docker.
Experience with machine learning frameworks (e.g., TensorFlow, PyTorch).
Experience with Git.
Excellent communication skills for presenting complex solutions to non-technical stakeholders and production team.