The ideal candidate is a self-starter, problem-solver and successful in combining technology and data into best-in-class outcomes
The candidate is energized by solving complex business problems and consistently effective in making high-judgement decisions at rapid pace amidst the frequent ambiguity that comes with charting a course of action with no precedent
Moreover, the ideal candidate is energized by an environment where strategy, innovation and decision-making are intentionally distributed, where candor, speed and data are highly valued and colleagues at all levels hold each other to unusually high standards on behalf of Quince customers
Responsibilities
You will be hands-on and participate in technical deep-dives to resolve issues and obstacles preventing the team from delivering high-quality output.
Build and test code continuously with scripting and programming languages.
Manage, track, and document changes to code with source control tools.
Deploy applications via automation with configuration management tools.
Measure performance and environment of application with system and application log tools.
Build and maintain the higher availability and disaster management strategies.
Cost optimization of infra TCO
Should be able to understand the instructions of the infrastructure head and execute plans or tasks accordingly.
Be able to handle complex architecture and be able to respond to support requests in a timely manner.
Requirements
4-7 years of relevant reliability engineering work experience in any of the Online technology companies.
Hands-on experience as a Site Reliability Engineer/Platform Engineer or DevOps
Experience with Scripting languages such as Bash, Golang, Java, JavaScript, Perl, Python, Ruby
AWS ( must have), Azure( good to have), IAM, EC2, VPC, ELB, ALB, Autoscaling, Lambda
Good understanding of object oriented programming, relational databases, NOSQL, caching systems, etc.
Experience with Open Source Database such as Cassandra, CockroachDB, CouchDB, PostgreSQL, MongoDB, MySQL,
Worked on Monitoring Tools such as CloudWatch, Datadog, Pagerduty, Sentry, Sumo Logic
Configuration management systems such as Ansible.
Load balancing and reverse proxies such as Nginx, HAProxy
Source code management and Implementation of security best practices.
Experienced in building monitoring/metrics and alerting tools (APM tool), a custom dashboards for each Application stack against the supported environment.
BS degree in Computer Science or related engineering disciplines