PaloAltoRecruiter Since 2001
the smart solution for Palo Alto jobs

Senior Site Reliability Engineer (SRE), Argo Watch

Company: Argo AI
Location: Palo Alto
Posted on: May 14, 2022

Job Description:

Who We Are Argo AI is a global self-driving products and services company on a mission to make the world---s streets and roadways safe, accessible, and useful for all. Our technology is built to enable commercial services for autonomous delivery and ridesharing in cities. With experienced leaders in the field and collaborative partnerships with some of the world---s top consumer brands, we---re working block by block, city by city to empower people and businesses to be more successful. We---re individuals driven by strong values to solve complex problems together. Come join us to reimagine the human journey. Meet The Team At Argo AI, the Argo Watch team is building the next-generation of cloud and AV software that monitors, manages, and maintains every autonomous vehicle across multiple metros in the US and Germany. More specifically, we are responsible for: Transportation as a Service (TaaS) APIs for partner engagements Mission Control for all autonomous vehicles Remote Operations including remote-guidance and remote-troubleshooting for autonomous vehicles Ride-hailing mobile apps - in iOS and Android - showcasing our core product end-to-end As a Site Reliability Engineer on the team, you will be responsible for helping to build and run these mission critical systems. Through the implementation of monitoring and automation, you will constantly ensure the health, reliability, scalability, and performance of the service. What You---ll Do Design and implement scalable distributed systems to facilitate the development of self-driving vehicles Monitor and maintain mission-critical production services to ensure maximum uptime Document actions to build a comprehensive library of runbooks, which will act as a knowledge base and foundation for automation Scale the reliability and velocity of our systems and processes through increased automation Participate in an on-call rotation and culture of continuous improvement through blameless postmortems What You'll Need To Succeed Degree in Computer Engineering, Computer Science, Electrical Engineering, Robotics or a related field Fundamental understanding of Linux operating system internals, TCP/IP networking, and storage subsystems Track record of scaling and securing services in the cloud (AWS, GCP) or cloud native environments Driven to leverage infrastructure-as-code principles to automate the creation of infrastructure resources (e.g. Terraform, CloudFormation) Production experience with some of the following: Postgres, Redis, ElasticSearch, and ActiveMQ Experience authoring and maintaining Kotlin, Java, and C++ codebases Experience running and observing microservices in a production environment Understanding of engineering design limitations and ability to provide guidance to teams to scale their services to achieve desired performance within budget A focus on increasing service reliability through defining and adhering to SLOs Strong communication skills and the ability to work effectively in a diverse and distributed team What We Offer You High-quality individual and family medical, dental, and vision insurance Competitive compensation packages Employer-matched 401(k) retirement plan with immediate vesting Employer-paid group term life insurance and the option to elect voluntary life insurance Paid parental leave Adoption & Surrogacy Assistance Program Paid medical leave 30 day paid sabbatical upon 5 years of employment Unlimited vacation Complimentary daily lunches, beverages, and snacks Pre-tax commuter benefits Monthly wellness stipend Professional development reimbursement Employee assistance program Discounted programs that include legal services, identity theft protection, pet insurance, and more Company and team bonding outlets: employee resource groups, quarterly team activity stipend, and wellness initiative Our Background Argo AI was founded in 2016 by industry experts with extensive experience building robotic systems for commercial applications. Our once-small team has since grown into an over 1,700-person strong company with strategic partnerships with some of the world---s leading consumer brands. With global headquarters in Pittsburgh, we operate in eight cities across the U.S. and Germany in areas where self-driving technology can have the biggest impact on improving safety, traffic, and transportation equity. At Argo AI, we believe that embracing differences delivers superior results. We are an equal opportunity employer that is committed to an inclusive environment for all employees.

Keywords: Argo AI, Palo Alto , Senior Site Reliability Engineer (SRE), Argo Watch, Engineering , Palo Alto, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category

Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

Palo Alto RSS job feeds