About the role
We are seeking a Data Scientist with an interest in Natural Language Processing (NLP). In this role, you will assist in developing NLP models, apply problem solving skills, and support in pretraining and fine-tuning Large Language Models (LLMs). The ideal candidate will have a foundational understanding of data science, with a desire to specialize in NLP and machine learning, and a passion for learning and developing real-world solutions.
What you'll do
- Assist in the development and implementation of Natural Language Processing (NLP) models and LLMs
- Apply problem-solving skills to assist in the development of products and solutions, with a focus on learning and growth
- Support in pretraining and fine-tuning Large Language Models (LLMs), task solvers, and RAG.
- Collaborate with cross-functional teams to implement models, monitor outcomes, and iterate on solutions
- Conduct data analysis and exploratory research to uncover patterns, trends, and insights that can improve model performance and drive business outcomes.
- Assist in the evaluation and selection of appropriate algorithms, techniques, and methodologies for solving specific NLP problems
- Collaborate with senior data scientists and managers to document and communicate the results of model development and experiments effectively
- Stay up-to-date with the latest advancements in NLP, LLMs, RAG, and related fields, actively seeking opportunities to enhance knowledge and skills
Qualifications
Bachelor's degree in computer science, Data Science, Engineering, or a related fieldProficiency in Arabic language (Native Arabic speaker) is a must. Knowledge of Modern Standard Arabic grammar and morphology is a plus3+ years of experience as a Data Scientist specialized in NLP or similar roleSolid understanding of machine learning, natural language processing (NLP), large language models (LLMs), and deep learning techniquesDeep expertise in Python's NLP libraries such as NLTK, SpaCy, and Hugging Face TransformersExcellent problem-solving skills and ML system design with an emphasis on product developmentHands-on experience in deep learning frameworks such as TensorFlow or PyTorchExperience in R&D is desirableExperience in one or more of the following is a must :LLMs pre-training, fine-tuning, and building LLMs, with demonstrable resultsBuilding Advanced (not plain-vanilla) Retrieval-Augmented Generation (RAG)Information Retrieval, Embedders and Re-rankers training and finetuningSense of Data quality and Data-centric approaches is a plusExperience in model optimization, serving and scalability is a plusExperience in OCR and its advanced techniques is a plusFamiliarity with cloud platforms like AWS, GCP, or Azure is a plusMust be Humble, Excellent, Relevant with a high sense of OwnershipYou will be at the forefront of an exciting time for the Middle East, joining a high-growth rocket-ship in an exciting space.You will be given a lot of responsibility and trust. We believe that the best results come when the people responsible for a function are given the freedom to do what they think is best.The fundamentals will be taken care of : competitive compensation, top-tier health insurance, and an enabling culture so that you can focus on what you do best.You will enjoy a fun and dynamic workplace working alongside some of the greatest minds in Al.We believe strength lies in difference, embracing all for who they are and empowered to be the best version of themselves.#J-18808-Ljbffr