Location
Mumbai | India
Job description
RESPONSIBILITIES:
Our company is currently seeking a Data Scientist (NLP/LLM) to join our Equities Quant team in Mumbai. In this role, you will play a crucial part in developing and scaling innovative Natural Language Processing (NLP), Machine Learning (ML), and Deep Learning (DL) algorithms. This is a unique opportunity for someone with solid NLP and tech skills to move into a investment role.
- Design and develop Natural Language Processing (NLP), Machine Learning (ML), and Deep Learning (DL) models to support data-driven business decisions.
- Develop and scale innovative NLP/ML/DL algorithms to extract valuable insights from unstructured data sources.
- Maintain comprehensive documentation of AI models, algorithms, and applications for reference and knowledge sharing.
Additional Responsibilities: - Design and develop custom ML, NLP, and Large Language Models (LLM) for AI/ML pipelines, including data ingestion, preprocessing, search and retrieval, Retrieval Augmented Generation (RAG), and prompt engineering.
- Develop robust evaluation solutions and tools to assess model performance, accuracy, consistency, and reliability during development and UAT.
- Assist in the deployment of machine learning models into production environments, ensuring reliability and scalability.
- Ensure adherence to specified standards, governance, and best practices in ML model development.
- Troubleshoot complex issues related to machine learning model development and data pipelines.
- Stay updated with higher-level trends in Large Language Models (LLMs) and open-source platforms.
- Nice-to-have: Experience with contributing to Github, open-source initiatives, research projects, or Kaggle competitions.
QUALIFICATIONS: Desired Skills And Experience - Bachelor's/Master's/Ph.D. degree in relevant fields like Computer Science, Mathematics, Statistics, Engineering, or Computational Linguistics.
- 4+ years of professional experience leveraging structured and unstructured data for data-driven analytics and insights using ML, NLP, and computer vision solutions.
- Proficiency in Python and experience with tools like Hugging Face, TensorFlow, Keras, PyTorch, and Spark.
- 3+ years of hands-on experience in developing NLP models, ideally with transformer architectures.
- 3+ years of experience in implementing information search and retrieval at scale.
- Knowledge and measurable hands-on experience with developing or tuning Large Language Models (LLM) and Generative AI (GAI).
- Familiarity with open-source platforms and trends in LLMs.
- Strong familiarity with higher-level trends in LLMs and open-source platforms.
- Nice-to-have: Experience with contributing to Github and open-source initiatives or involvement in research projects and Kaggle competitions.
Job tags
Salary