logo

JobNob

Your Career. Our Passion.

Data Engineer - R/Python


Scaling Theory


Location

Bangalore | India


Job description

Requirements : Mandatory Skills : Primary skills (Must Have) : - Azure data factory, azure data lake, java, sql, python, and r.- Deep knowledge of azure data lake, azure data factory, and other azure cloud services.- Proficiency in programming and query languages like python, sql, java, u-sql, and r.- Familiarity with data warehousing concepts, etl processes, and big data technologies.- Skills in data modeling and database systems.- Understanding data security and privacy best practices.- Ability to optimize data queries and etl jobs for performance.- Experience in devops for continuous integration and delivery.- Excellent problem-solving and communication skills to explain complex data processes.- Integration with source systems technical implementation of required pipelines for bronze, silver, gold, and diamond layers.Roles and Responsibilities :1. Data Collection and Integration : Data engineers collect data from various sources, including databases, APIs, external data providers, and streaming sources. They must design and implement efficient data pipelines to ensure a smooth flow of information into the data warehouse or storage system.2. Data Storage and Management : Once the data is collected, data engineers are responsible for its storage and management. This involves choosing appropriate database systems, optimizing data schemas, and ensuring data quality and integrity. They also must consider scalability and performance to handle large volumes of data.3. ETL (Extract, Transform, Load) : Processes ETL is a fundamental process in data engineering. Data engineers design ETL pipelines to transform raw data into a format suitable for analysis. This involves data cleansing, aggregation, and enrichment, ensuring the data is usable for data scientists and analysts.4. Big Data Technologies : In today's data landscape, dealing with big data is the norm rather than the exception. Data engineers work with big data technologies such as Hadoop and Spark to efficiently process and analyze massive datasets.5. NoSQL Databases : In addition to traditional relational databases, data engineers often work with NoSQL databases like MongoDB and Cassandra, which are well-suited for handling unstructured or semi-structured data.6. Cloud Computing : Cloud platforms like AWS, Azure, and Google Cloud have become the backbone of modern data infrastructure. Data engineers leverage these platforms to build scalable and cost-effective data solutions.7. Distributed Systems : Data engineering often involves distributed systems architecture to handle huge data volumes and ensure fault tolerance. Understanding how distributed systems work is essential for data engineers.8. Streaming : Data Real-time data processing is crucial in many industries. Data engineers work with streaming technologies like Apache Kafka to handle and analyze data as it flows in. (ref:hirist.tech)


Job tags



Salary

All rights reserved