Corporate Site Reliability Engineer

Tech Ops Team | San Francisco, CA

Dropbox is the home for your most important stuff—now we're bringing it to life with a growing family of products. As we scale our global brand, there’s plenty of space for you to grow alongside us and simplify life for millions of people around the world.

Our tech operations team creates the layer from which all Dropbox experiences are built. As our user base and product family continues to grow, tech operations is responsible for strengthening our foundation.

As a Corporate Site Reliability Engineer at Dropbox, you’ll have the opportunity to define and influence the strategy and technical direction of one of the most exciting technology companies in the world. You will be architecting, creating and delivering redundant, scalable and secure enterprise technology solutions for our internal users. You will be looked upon as an expert and provide technical leadership and guidance to fellow engineers. Your solutions will have a direct impact on the productivity of Dropboxers and help propel the company to greater heights.

The most successful candidates for this role will have strong analytical and troubleshooting skills, fluency in coding or scripting, solid communication skills and a desire to solve complex problems of scale. You should be someone who’s capable of seeing the big picture and driving a project from start to end. We are particularly interested in systems administrators familiar with running web services at scale. Depth in networking technologies and UNIX/Linux internals are strong pluses.

Responsibilities

  • Manage availability, latency, stability and efficiency of Dropbox corporate services
  • Build automation tools to detect, repair and prevent problems
  • Design, review and influence ongoing design, architecture and standards
  • Develop methods for operating services and systems
  • Deploy and maintain web servers, authentication services, directory services, DHCP, DNS, etc.
  • Perform periodic on-call duty

Requirements

  • 6+ years hands-on experience deploying and operating an enterprise class services infrastructure
  • Proven, demonstrable expert-level skills in critical infrastructure services design, implementation and integration in operationally sensitive production environments
  • Experience coordinate or leading small cross-team technical projects
  • Fluency in at least one high-level programming language (Python, Java or similar) and strong scripting skills
  • Experience in OSes and systems (e.g. UNIX internals), load balancing services (DNS, DHCP), storage and clustering
  • Strong knowledge of TCP/IP networking, network and application-level security
  • Strong troubleshooting abilities across all platforms (Mac / Linux / Windows)
  • Experience in management/automation tools such as Puppet/Chef, Vagrant, etc.
  • Experience with managing/automating virtual machines
  • Additional experience strongly preferred:
  • Experience automating Google Apps using tools such as Google Apps Manager (GAM)
  • Familiarity with VOIP, VPN technologies
  • Familiarity with server rooms, UPS, AC, and electrical infrastructure
Back to Tech Ops Team

Other open positions for the Tech Ops Team