Tredence Analytics Solutions Private Limited
Location
Bangalore | India
Job description
Role: Senior Databricks Engineer / Databricks Engineer
Experience : 5-8 years
Location: Bangalore, Chennai, Delhi, Pune, Kolkata
About Tredence:
Tredence is a global data science solutions provider founded in 2013 by Shub Bhowmick, Sumit Mehra, and Shashank Dubey focused on solving the last-mile problem in AI. Headquartered in San Jose, California, the company embraces a vertical-first approach and an outcome-driven mindset to help clients win and accelerate value realization from their analytics investments. The aim is to bridge the gap between insight delivery and value realization by providing customers with a differentiated approach to data and analytics through tailor-made solutions. Tredence is 2200-plus employees strong with offices in San Jose, Foster City, Chicago, London, Toranto, and Bangalore, with the largest companies in retail, CPG, hi-tech, telecom, healthcare, travel, and industrials as clients.
Primary Roles and Responsibilities:
â— Developing Modern Data Warehouse solutions using Databricks and AWS/ Azure Stack
â— Ability to provide solutions that are forward-thinking in data engineering and analytics space
â— Collaborate with DW/BI leads to understand new ETL pipeline development requirements.
â— Triage issues to find gaps in existing pipelines and fix the issues
â— Work with business to understand the need in reporting layer and develop data model to fulfill reporting needs
â— Help joiner team members to resolve issues and technical challenges.
â— Drive technical discussion with client architect and team members
â— Orchestrate the data pipelines in scheduler via Airflow
Skills and Qualifications:
â— Bachelor's and/or master's degree in computer science or equivalent experience.
â— Must have total 6+ yrs. of IT experience and 3+ years' experience in Data warehouse/ETL projects.
â— Deep understanding of Star and Snowflake dimensional modelling.
â— Strong knowledge of Data Management principles
â— Good understanding of Databricks Data & AI platform and Databricks Delta Lake Architecture
â— Should have hands-on experience in SQL, Python and Spark (PySpark)
â— Candidate must have experience in AWS/ Azure stack
â— Desirable to have ETL with batch and streaming (Kinesis).
â— Experience in building ETL / data warehouse transformation processes
â— Experience with Apache Kafka for use with streaming data / event-based data
â— Experience with other Open-Source big data products Hadoop (incl. Hive, Pig, Impala)
â— Experience with Open Source non-relational / NoSQL data repositories (incl. MongoDB, Cassandra, Neo4J)
â— Experience working with structured and unstructured data including imaging & geospatial data.
â— Experience working in a Dev/Ops environment with tools such as Terraform, CircleCI, GIT.
â— Proficiency in RDBMS, complex SQL, PL/SQL, Unix Shell Scripting, performance tuning and troubleshoot
â— Databricks Certified Data Engineer Associate/Professional Certification (Desirable).
â— Comfortable working in a dynamic, fast-paced, innovative environment with several ongoing concurrent projects
â— Should have experience working in Agile methodology
â— Strong verbal and written communication skills.
â— Strong analytical and problem-solving skills with a high attention to detail.
Mandatory Skills: Python/ PySpark / Spark with Azure/ AWS Databricks
Job tags
Salary