Senior Site Reliability Engineer (SRE), Argo Watch
Company: Argo AI
Location: Palo Alto
Posted on: May 14, 2022
|
|
Job Description:
Who We Are Argo AI is a global self-driving products and
services company on a mission to make the world---s streets and
roadways safe, accessible, and useful for all. Our technology is
built to enable commercial services for autonomous delivery and
ridesharing in cities. With experienced leaders in the field and
collaborative partnerships with some of the world---s top consumer
brands, we---re working block by block, city by city to empower
people and businesses to be more successful. We---re individuals
driven by strong values to solve complex problems together. Come
join us to reimagine the human journey. Meet The Team At Argo AI,
the Argo Watch team is building the next-generation of cloud and AV
software that monitors, manages, and maintains every autonomous
vehicle across multiple metros in the US and Germany. More
specifically, we are responsible for: Transportation as a Service
(TaaS) APIs for partner engagements Mission Control for all
autonomous vehicles Remote Operations including remote-guidance and
remote-troubleshooting for autonomous vehicles Ride-hailing mobile
apps - in iOS and Android - showcasing our core product end-to-end
As a Site Reliability Engineer on the team, you will be responsible
for helping to build and run these mission critical systems.
Through the implementation of monitoring and automation, you will
constantly ensure the health, reliability, scalability, and
performance of the service. What You---ll Do Design and implement
scalable distributed systems to facilitate the development of
self-driving vehicles Monitor and maintain mission-critical
production services to ensure maximum uptime Document actions to
build a comprehensive library of runbooks, which will act as a
knowledge base and foundation for automation Scale the reliability
and velocity of our systems and processes through increased
automation Participate in an on-call rotation and culture of
continuous improvement through blameless postmortems What You'll
Need To Succeed Degree in Computer Engineering, Computer Science,
Electrical Engineering, Robotics or a related field Fundamental
understanding of Linux operating system internals, TCP/IP
networking, and storage subsystems Track record of scaling and
securing services in the cloud (AWS, GCP) or cloud native
environments Driven to leverage infrastructure-as-code principles
to automate the creation of infrastructure resources (e.g.
Terraform, CloudFormation) Production experience with some of the
following: Postgres, Redis, ElasticSearch, and ActiveMQ Experience
authoring and maintaining Kotlin, Java, and C++ codebases
Experience running and observing microservices in a production
environment Understanding of engineering design limitations and
ability to provide guidance to teams to scale their services to
achieve desired performance within budget A focus on increasing
service reliability through defining and adhering to SLOs Strong
communication skills and the ability to work effectively in a
diverse and distributed team What We Offer You High-quality
individual and family medical, dental, and vision insurance
Competitive compensation packages Employer-matched 401(k)
retirement plan with immediate vesting Employer-paid group term
life insurance and the option to elect voluntary life insurance
Paid parental leave Adoption & Surrogacy Assistance Program Paid
medical leave 30 day paid sabbatical upon 5 years of employment
Unlimited vacation Complimentary daily lunches, beverages, and
snacks Pre-tax commuter benefits Monthly wellness stipend
Professional development reimbursement Employee assistance program
Discounted programs that include legal services, identity theft
protection, pet insurance, and more Company and team bonding
outlets: employee resource groups, quarterly team activity stipend,
and wellness initiative Our Background Argo AI was founded in 2016
by industry experts with extensive experience building robotic
systems for commercial applications. Our once-small team has since
grown into an over 1,700-person strong company with strategic
partnerships with some of the world---s leading consumer brands.
With global headquarters in Pittsburgh, we operate in eight cities
across the U.S. and Germany in areas where self-driving technology
can have the biggest impact on improving safety, traffic, and
transportation equity. At Argo AI, we believe that embracing
differences delivers superior results. We are an equal opportunity
employer that is committed to an inclusive environment for all
employees.
Keywords: Argo AI, Palo Alto , Senior Site Reliability Engineer (SRE), Argo Watch, Engineering , Palo Alto, California
Click
here to apply!
|