Data Scientist - Generative AI
Location
Pune | India
Job description
What You'll Be Doing:
Key Requirements of a Data Scientist - Generative AI
- 5+ years of Industry experience primarily related to Unstructured Text Data and NLP (PhD work and internships will be considered if they are related to unstructured text in lieu of industry experience but not more than 2 years will be accounted towards industry experience) and 6 months+ Generative AI experience.
- Hands-on experience on Data Science, NLP, LLM, Generative AI, Deep Learning
- Healthcare domain experience is mandatory
- Generative AI certification, course work and evidence of relevant GenAI/Deep Learning courses available in the market (courses completion from any renowned LLM companies NVIDIA, Cohere, Predibase, Pinecone, Microsoft, Google, databricks)
- Understanding of NLP concepts like sequence tagging, POS, sentiment analysis, machine translation, summarization
- You have solid understanding of deep learning architecture like encoder-decoder (T5), encoder only transformers (BERT family of models), decoder only transformers, recurrent neural network, seq2seq models, LSTM, GPT, VAE, and GANs.
- You are proficient in understanding Transformers, Word Embedding, Positional Encoding, Attention models and strong knowledge of maths/algebra to understand deep knowledge on latest transformers technique.
- You are proficient in Python and have experience with machine learning libraries and frameworks such as TensorFlow, PyTorch, or Keras.
- You have strong knowledge of data structures, algorithms, and software engineering principles.
- You are familiar with cloud-based platforms and services, such as AWS, GCP, or Azure.
- You have experience with natural language processing (NLP) techniques and tools, such as SpaCy, NLTK, or Hugging Face.
- You have knowledge of software development methodologies, such as Agile or Scrum.
- You possess excellent problem-solving skills, with the ability to think critically and creatively to develop innovative AI solutions.
- You have strong communication skills, with the ability to effectively convey complex technical concepts to a diverse audience.
- You possess a proactive mindset, with the ability to work independently and collaboratively in a fast-paced, dynamic environment.
- Experience with developing and deploying products in production with experience in two or more of the following languages (Python, C++, Java, Scala)
Academic Qualifications:
- Master's degree or above in Computer Science, Computational Linguistics, Mathematics, Physics, or Electrical Engineering with research experience from a strong academic program along with thesis
- Completion of thesis/research is required as part of graduation in computer science, artificial intelligence, Mathematics, Physics, Electrical Engineering or statistics.
Nice to Have
- A PhD degree in Computer Science, Artificial Intelligence, Computational Linguistics, Machine Learning, or related technical field from a strong academic program
- Medical concepts with codes from standard ontologies (SNOMED CT, LOINC, RxNorm, ICD, etc.)
- Experience with Kubernetes and dockers
- Experience building REST API's for AI work and knowledge of microservices architecture
- Participation in open source community projects
- Publication record in top NLP conferences (NIPS, ICLR, ACL, NAACL, EMNLP, SIGIR, etc.)
Job tags
Salary