• Site Reliability Engineer

    Job Locations US-WA-Kent
    Category
    Enterprise Technology
    Job ID
    2688
  • Overview

    As part of a small, passionate and accomplished team of experts, you will provide the data necessary to design, build and launch rockets faster, cheaper and with continually higher quality. We accomplish this by building state-of-the art software, and analyzing data to uncover patterns for quick decision making.


    We design systems that track millions of physical parts and complex manufacturing activities in remote locations. We build systems that process massive amounts of data and engineering tools that enable rapid design and iteration. We are seeking team members of all backgrounds who are passionate about space and who have a strong desire to serve on a team that is the backbone of the company.


    As a Site Reliability Engineer, you will work on rewarding problems and interesting technologies. You will implement the infrastructure that allows for rapid development and iteration of software throughout the company, including distributed systems and embedded software on-board our rockets and space vehicles. You will make decisions and implement systems that affect the productivity of thousands of rocket scientists and engineers throughout the company.


    What makes our SRE’s successful?



    • Technical breadth and depth with a strong understanding of emerging trends

    • A strong bias for automating everything

    • Humility and the willingness to operate in unfamiliar domains

    • A strong “customer first” personality and desire to be a subject matter expert

    Responsibilities


    • Engage in and improve the whole lifecycle of software – from inception and design, through deployment, operation, and refinement

    • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.

    • Maintain software once it is live by measuring and monitoring availability, latency and overall system health.

    • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.

    • Practice sustainable incident response and blameless postmortems.

    • Configure, deploy, scale, and administer open source and commercial software

    Qualifications


    • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.

    • Understanding of and experience with modern software development practices

    • Interest in analyzing and troubleshooting distributed systems.

    • Ability to debug and optimize code and automate routine tasks.

    • Experience in one or more of the following: C, C++, Java or Python

    • Must be a U.S. citizen or permanent resident (current Green Card holder)

    Desired


    • Experience with the Atlassian suite of products, including JIRA and Confluence

    • Experience with relational or non-relational databases, including configuring, deploying, scaling, and troubleshooting

    • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

    Options

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed