Senior Computer Vision Researcher
Location
Bangalore | India
Job description
You will bring your experience in vision language model technology to Dolby. As a prerequisite for your work, you have proven knowledge of large language models, vision language models, multi-modality, vision transformer, diffusion model, computer vision, and image/video processing. Your solid understanding of trade-offs between computational complexity and achieved performance of different implementations will guide you in decisions. You will research and develop new neural network architectures, training methods, representation, and processing algorithms on the areas of modern vision language model.
You must have an in-depth understanding of the key processes in vision language model. You have extensive experience in developing architectures, low-level/mid-level/high-level computer vision tasks, and processing algorithms with deep learning to improve performance of vision language model tasks. You will join a highly skilled and motivated team to research and develop advanced computer vision and processing technologies to improve end user experience.
Job Functions - Work closely with other domain experts to refine and execute Dolby s technical strategy in artificial intelligence and machine learning.
- Use deep learning to create new solutions and enhance existing applications.
- Combine domain knowledge with state-of-the-art deep learning techniques that run on a wide range of computing platforms.
- Works with a general understanding of competitive technologies in the area of focus.
- Works to understand the system context in which new technologies will be used and the requirements that the technology must fulfill for success.
- Creates early-stage conceptual models that demonstrate feasibility.
- Document and present the new architectures and algorithms in various forms, such as technical white papers and internal meeting
- Maintain the highest technical and moral integrity for Dolby in the workplace and marketplace.
Skills, Education, and Experience Required
- Ph.D. degree in electrical engineering, computer engineering, or computer science.
- Minimum of 3 years of experience in developing algorithms for machine learning and deep learning, computer vision, and vision-language model.
- Intimately familiar with computer graphics, video processing systems, as well as corresponding software in consumer and professional products.
- Strong theoretical background in AI technologies and a proven ability to apply AI to audio, video, and speech technologies.Proficient in Python (Tensorflow/Pytorch) programming applied to vision language model is required.
- Self-motivated, quick learner and strong team player with ability to work with minimal supervision.
- Excellent written and verbal communication skills.
- It is highly desirable to have one or more of the following:
- Experience in filing patents of newly developed technologies.
- Hands-on experience in applying vision language model algorithms to real-world problems
- Proven track record of successful research accomplishments, published papers, and/or patent applications in the field of vision-language model.
Job tags
Salary