DE Jobs

Search from over 2 Million Available Jobs, No Extra Steps, No Extra Forms, Just DirectEmployers

Job Information

Toyota Principal Site Reliability Engineer in Plano, Texas

Excited to grow your career at Toyota?

We value our talented employees, and whenever possible strive to help one of our associates grow professionally before recruiting new talent to our open positions. If you think the open position you see is right for you, we encourage you to apply!

Our people make all the difference in our success .

To save time applying, Toyota does not offer sponsorship of job applicants for employment-based visas or any other work authorization for this position at this time.

Who we’re looking for

Toyota’s Connected Technologies Department is looking for a passionate and highly-motivated Principal Site Reliability Engineer .

The primary responsibility of this role is for building scalable and reliable systems by combining software engineering principles with operational expertise, contributing to the overall stability and success of the connected vehicle infrastructure.

What you’ll be doing

  • Provide leadership and act as technical authority and source of SRE expertise.

  • Lead and contribute to large projects aimed at improving system reliability, scalability & efficiency.

  • Contribute to strategic planning for Reliability Engineering team operations.

  • Collaborate with engineering teams to identify and implement changes to improve observability and troubleshooting tools.

  • Develop and implement standards for adequate observability.

  • Setup, maintain and improve proactive system monitoring capabilities.

  • Act as SME on to support P1/P2 incidents.

  • Work with 3rd party service suppliers to review SLAs and establish measurable SLOs.

  • Propose and implement changes to enhance the overall resilience of the system.

What you bring

  • Extensive experience in system engineering, or DevOps.

  • Proficiency in Java coding, profiling, and source code control tools.

  • Significant experience working in SRE or related roles, demonstrating a deep understanding of reliability engineering principles and practices.

  • Ability to read through system architecture design documents and identify strategies for observability.

  • Proven ability to troubleshoot containerized applications, debug pods, services, and clusters.

  • Proficiency in programming languages commonly used in infrastructure and software development, such as Python, Go, Java, or similar.

  • Advanced knowledge of cloud computing platforms (e.g., AWS, Azure, Google Cloud) and experience with deploying and managing large-scale distributed systems in cloud environments.

  • Demonstrated experience in leading complex technical projects, driving initiatives for improving system reliability, scalability, and performance.

  • Proficiency with Application Performance Monitoring using tool such as DataDog and New Relic.

Added bonus if you have.

  • Bachelors in CS, BA, MIS or equivalent work experience

  • Experience with Firebase Performance Monitoring

  • Google SRE Certification

  • Performance testing experience

  • Familiarity with OpenTelemetry (traces, metrics, logs)

  • Service virtualization tools

What we’ll bring

During your interview process, our team can fill you in on all the details of our industry-leading benefits and career development opportunities. A few highlights include:

  • A work environment built on teamwork, flexibility and respect

  • Professional growth and development programs to help advance your career, as well as tuition reimbursement

  • Vehicle purchase & lease programs

  • Comprehensive health care and wellness plans for your entire family

  • Flextime and virtual work options (if applicable)

  • Toyota 401(k) Savings Plan featuring a company match, as well as an annual retirement contribution from Toyota regardless of whether you contribute

  • Paid holidays and paid time off

  • Referral services related to prenatal services, adoption, child care, schools and more

  • Flexible spending accounts

#ConnectedTechnologies

Job Posting End Date :

at 12AM US/Central

Management Level :

16

DirectEmployers