logo

JobNob

Your Career. Our Passion.

SRE/Observability Engineer


Snap-on


Location

Markham, ON | Canada


Job description

Who We Are: Since 2007, Dealer-FX has transformed how automotive retailers manage their service operations and interact with consumers. Through advanced data integrations and mobile technology, Dealer-FX streamlines processes and communication for automotive service departments while delivering convenience, transparency, and trust to consumers. Dealer-FX is a wholly owned subsidiary of Snap-on Incorporated (NYSE: SNA), which acquired Dealer-FX in 2021. Dealer-FX has been serving automotive OEMs and dealerships for almost a decade. Dealer-FX is transforming how millions of consumers interact with automotive brands and their retailers. Our platform uses advanced data analysis and mobile applications to deliver convenience, transparency, and trust to consumers and increase efficiency, profitability, retention, and brand loyalty to OEMs and dealers. What We are looking for: At Dealer-FX, we put our users first. The automotive landscape and world are constantly changing, so we need a SRE/Observability Engineer who is continuously adapting and excited to work on products that affect thousands of people daily. We are a team of tech experts who work on AWS-based SaaS solutions for the Automotive Industry. Automotive mobile data and analytics are the core of our business. As a SRE/Observability Engineer, you will have an opportunity to work in our growing Platform Engineering Team (DevOps, SRE and DBA) team to ensure that our expansive suite of products are running optimally in our production environments, and that they are continuously being upgraded in the most efficient way.
What You’ll be Doing: - Lead Deployment Strategy: Spearhead the development and execution of deployment strategies in collaboration with other DevOps team members, ensuring seamless and efficient software releases. - Deployment Optimization: Optimize deployment processes to minimize deployment downtime, reduce risk, and enhance efficiency in collaboration with the DevOps team. - Automation Champion: Identify and implement automation opportunities across deployment, monitoring, and incident management processes to streamline operations and reduce manual intervention. - Proactive Monitoring: Continuously monitor the stability and performance of our production environments, actively identifying potential issues and proactively addressing them to maintain optimal system health. - Daily Health Checks and QA Collaboration: Perform daily health checks and functionality testing of the main components of the system in close collaboration with our QA (Quality Assurance) team. This involves ensuring that critical system functionalities are operating as expected, promptly reporting any anomalies or issues, and working collaboratively to resolve them. - Incident Resolution: Participate in incident resolution by promptly diagnosing and resolving issues related to production environments, ensuring minimal disruption to our users. - On-call Support: Contribute to the on-call rotation schedule to provide for critical systems, ensuring quick response and resolution to any production incidents. What You’ll Bring: - Several years of hands-on experience in a similar role, working extensively with AWS services, Azure DevOps, and DevOps practices. - Demonstrated experience with New Relic and Kibana/Elasticsearch for monitoring, log analysis, performance optimization and observability. - Proficiency in Windows Server administration, especially in the context of IIS. - Hands-on experience with AWS ECS and .NET Core for containerized applications. - Knowledge of Terraform and Ansible for infrastructure provisioning and configuration management. - Familiarity with MS SQL Server and PostgreSQL/MySQL database management. - Experience working with AWS API Gateway and AWS Lambda functions. - Strong knowledge of AWS services, including EC2, ECS, Lambda, API Gateway, and CloudWatch, etc. - Experience with Azure DevOps for managing deployment pipelines, ticketing, version control, and documentation. - Proficiency in managing cloud resources efficiently and cost-effectively. - Strong scripting skills using PowerShell, Bash and Python for automation tasks and deployments. - Experience with infrastructure as code (IaC) using tools like Terraform and Ansible. - Proficiency in using New Relic and Kibana/Elasticsearch for monitoring, - Knowledge of cloud-native monitoring and observability tools like AWS CloudWatch. - Experience with containerization technologies like Docker and container orchestration with AWS ECS and Kubernetes. - In-depth understanding of networking concepts, including TCP/IP, DNS, DHCP, VPNs, and load balancing. - Proficiency in configuring and managing network resources within AWS, including Virtual Private Cloud (VPC) setup, security groups, and network ACLs. - Knowledge of network troubleshooting and packet analysis to diagnose and resolve connectivity issues efficiently. - Strong knowledge of Domain Name System (DNS) configuration and management, both internally and externally. - Experience with domain registration, DNS record management, and troubleshooting DNS-related issues. - Familiarity with Amazon Route 53. - Understanding of network security principles and best practices, including firewalls, intrusion detection/prevention systems, and encryption. - Ability to implement network security measures to protect data and systems. - Knowledge of security best practices and compliance requirements in cloud environments. - Effective collaboration with cross-functional teams, including developers, QA, and system administrators. - Good communication skills to document processes and share knowledge. - Strong analytical and problem-solving skills to diagnose and resolve complex issues. - A commitment to staying up-to-date with emerging technologies and best practices in DevOps and cloud computing. - A willingness to adapt to evolving technologies and the ability to thrive in a dynamic, fast-paced environment.
What’s in it for you? - Vast opportunities for growth - Competitive compensation packages - A flexible work schedule for work-life balance - Comprehensive Training and Development support - Group health and dental benefits - Employee Assistance Program - 3 weeks of paid vacation - Cool company events and team building No unsolicited agency referrals Dealer-FX is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability or any other characteristic protected by law. Accommodation is available upon request for applicants with disabilities.


Job tags

Full timeFlexible hours


Salary

All rights reserved