- Strong sense of ownership, customer service, and integrity demonstrated through clear communication
- Deep understanding of the Linux and system administration at large-scale
- Deep understanding of AWS services including EKS, ECS, MSK
- Coding experience using a high-level programming language like: Python, Golang
- Experience building and managing infrastructure in AWS using Terraform.
- Experience running docker based workloads in production
- Experience with Kubernetes is a plus, but not required
- Keeping the lights on - Oncall and Alert Handling
- Manage new build-outs (additions and decommissions)
- Develop and maintain scripts used for environment monitoring and task automation (Python, Ansible, Puppet)
- Experience setting up and managing monitoring tools such as Graphite, Prometheus, InfluxDB, Grafana
- Set priorities and work efficiently in a fast-paced environment
- Measure and optimize system performance
- Demonstrate ability to deliver results on time with high quality
- Experience with Spinnaker is a plus.
- Opportunity to work on bleeding-edge projects
- Work with a highly motivated and dedicated team
- Competitive salary
- Flexible schedule
- Benefits package - medical insurance, sports
- Corporate social events
- Professional development opportunities
- Well-equipped office
-
Site Reliability Engineer
hace 1 semana
Parco Guadalajara, México**Tu siguiente oportunidad está en Parco** · Buscamos un **Site Reliability Engineer (SRE)** que nos apoye a garantizar la confiabilidad, disponibilidad y eficiencia de nuestros sistemas. Que tenga disposición para colaborar estrechamente con los equipos de Ingeniería, Data y Pro ...
-
Site Reliability Engineer
hace 2 días
GrainChain Inc Zapopan, MéxicoTe estamos buscando, únete a GrainChain · Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. · Somos una empresa de tecnología que ayuda a la indu ...
-
Reliability Engineer
hace 6 días
Bosch Group Guadalajara, MéxicoCompany Description · Do you want beneficial technologies being shaped by your ideas? Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology with us, you will have the chance to improve quality of life all across the gl ...
-
Reliability Engineer
hace 3 días
Bosch Group Guadalajara, MéxicoCompany Description · Do you want beneficial technologies being shaped by your ideas? Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology with us, you will have the chance to improve quality of life all across the gl ...
-
Reliability Engineer Iii
hace 1 semana
f5 Guadalajara, MéxicoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. · But our success isn't driven solely by what we do. We also ...
-
Site Reliability Engineer
hace 2 días
Finastra Guadalajara, MéxicoYour deliverables as a Site Reliability Engineer will include, but are not limited to, the following: · - Work with containers and container orchestration systems such as Kubernetes · - Capacity Planning to determine resource requirements of your service for it to be scalable, ef ...
-
Site Reliability Engineer
hace 6 días
Grid Dynamics Guadalajara, MéxicoWe are seeking a strong Site Reliability Engineer with good technical expertise. Our client is the world's largest American retail chain sells supplying tools, construction products, and services with over 90 distribution centers throughout the United States to serve over 2,000 s ...
-
Site Reliability Engineer
hace 1 semana
Avertium Guadalajara, MéxicoAvertium is the security partner that companies turn to for end-to-end Cybersecurity solutions that attack the chaos of the cybersecurity landscape with context. By fusing together human expertise and a business-first mindset with the right combination of technology and threat in ...
-
Network Reliability Engineering
hace 4 horas
AstraZeneca Guadalajara, MéxicoThe Network Engineer will be responsible for the Network monitoring/troubleshooting/break fix incidents/changes primarily on Cisco Technologies with additional Load balancer, Firewall, Proxy and Wireless services. · **Key Responsibilities**: · As a network engineer, will be accou ...
-
Site Reliability Engineer Iii
hace 6 días
f5 Guadalajara, MéxicoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. · Why do you want to join our team? · F5 has innovated a cons ...
-
Senior Site Reliability Engineer
hace 2 días
Oracle Guadalajara, MéxicoOracle · - s Cloud Infrastructure team is supporting and building Block Storage Service, it involves Support, Operation, Deployment at scale in a broadly distributed multi-tenant cloud environment, closely working with various engineering teams. Our customers run their businesses ...
-
Senior Site Reliability Engineer
hace 3 horas
Nextiva Mexico Guadalajara, MéxicoAt Nextiva, we create connected communication tools that help businesses stay in touch with their customers and teams. Over 100,000 companies rely on Nextiva for phone service and customer management tools. We're not your parent's phone company. · Founded in 2008, Nextiva took on ...
-
Site Reliability Engineer Ii
hace 1 día
F5 Guadalajara, México**The systems reliability **engineer will be responsible to incorporate aspects of software engineeringand applies them to infrastructure and operationsproblems. This position will focus on the engineering and support for single sign on (SSO) and Azure cloud-based infrastructure. ...
-
Site Reliability Engineer
hace 1 semana
Oracle Zapopan, MéxicoSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectur ...
-
Site Reliability Engineer Iii/network
hace 3 horas
f5 Guadalajara, MéxicoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. · Position Summary Software engineering is a core discipline ...
-
Site Reliability Engineering Team Lead
hace 2 días
Finastra Guadalajara, MéxicoAct as primary SME for Cloud tooling, as well as mentoring colleagues on the SRE team · - Assume leadership and mentorship responsibilities in post-mortem reviews of incidents · - Work with containers and container orchestration systems such as Kubernetes · - Capacity Planning to ...
-
Principal Site Reliability Engineer
hace 6 días
Oracle Zapopan, México**Responsibilities** · - Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure · - Act as escalation point for critical issues that may not have a documented procedure and provide Root Cause Analysis (RCA) · - Understand the end-to-end configurati ...
-
Senior Site Reliability Engineer
hace 1 día
Oracle Zapopan, MéxicoOracle Database Technology including RAC, Dataguard, Exadata and ASM/RMAN etc. · - Technologies for scripted and orchestrated automation and Some understanding of Security fundamentals. · - Development using Python, SQL/PlSql, Java/JavaScript, or Oracle APEX · Career Level - IC3 ...
-
Senior Site Reliability Engineer
hace 1 semana
Oracle Zapopan, MéxicoThe role provides a mixture of production platform Operations ownership as well as engineering. You will solve challenging technical problems, identify improvements, and work on implementing your recommendations. You will also work directly with high-level developers on projects ...
-
Senior Site Reliability Engineer
hace 1 semana
Oracle Zapopan, MéxicoDevOps/Service Reliability Engineer - Shared Infrastructure and Engineered System Platform Services · A unique opportunity to join a rapidly growing world-class team of engineer, implement, and operate cutting edge systems built on Oracle technologies that make up Oracle Cloud Co ...
Site Reliability Engineer - Guadalajara, México - Grid Dynamics
Descripción
SummaryThe Site Reliability Engineer (SRE) position requires a mix of strategic engineering and design along with hands-on, technical work. An ideal candidate will have experience building and managing infrastructure in AWS, and have coding skills to automate tasks and build tools to help with our service operations. The SRE will configure, tune, and troubleshoot multi-tiered systems to achieve optimal application performance, stability and availability. The SRE will work closely with the software engineers, infrastructure and network engineers to deploy and maintain our services.
Key Qualifications
The successful candidate will be highly self-motivated with a passion for excellence, quality and attention to detail.
Responsibilities Of The SRE Include The Following
Grid Dynamics is a leading provider of technology consulting, agile co-creation, scalable engineering and data science services for Fortune 500 corporations undergoing digital transformation.
We work in close collaboration with our clients on digital transformation initiatives that span strategy consulting, early prototypes and enterprise-scale delivery of new digital platforms. We help organizations become more agile and create innovative digital products and experiences using deep expertise in emerging technology, top global engineering talent, lean software development practices, and high-performance product culture.
Headquartered in Silicon Valley with over 1,300 technologists located in engineering delivery centers throughout the US, Central and Eastern Europe, Grid Dynamics has architected and delivered some of the most extensive digital transformation programs in the retail, technology and financial sectors to help its clients win market share, shorten time to market and reduce costs of digital operations on a massive scale.
To learn more about Grid Dynamics, visit , or follow us on Twitter @GridDynamics.