Careers at Facebook

Software Engineering

Performance and Capacity Engineer

Místo konáníMenlo Park, CA
Facebook was built to help people connect and share, and over the last decade our tools have played a critical part in changing how people around the world communicate with one another. With over a billion people using the service and more than fifty offices around the globe, a career at Facebook offers countless ways to make an impact in a fast growing organization.
Facebook is seeking a Software Engineer to join the Capacity Engineering & Analysis team to focus on site-wide performance and capacity projects. This person would be required to work cross-functionally with a number of teams to ensure optimal operation and growth of our computing resources from both a cost and technology perspective. This position is full-time and based at in our Menlo Park office.

Responsibilities

  • Scale the largest web capacity in the world: Work with Product Engineering, Infrastructure Engineering, and Data Engineering team to find the optimal way to scale the infrastructure. Tens of billions of user requests, hundreds of peta bytes of data, thousands of giga bps of network flow. Welcome to the amazingly largest Internet service in the world!
  • Own end-to-end product design, launch, and operation: Support architecture design, define networking requirement, and help code build from scratch to support new product launch.
  • Tackle the state-of-the-art hardware performance issues: Analyze and debug difficult server performance issues (latest in industry), identify bottlenecks and optimize product/service performance to improve user experience.
  • Solve hardest software performance issues: Work with software developers closely to improve code base performance (e.g. algorithm redesign), reduce resource consumption and shorten request latency.
  • Plan the largest server and datacenter capacity: Own and drive overall Facebook capacity planning work for all different products/services and recommend DC expansion plan.
  • Develop coolest tools to monitor billions of user requests: Write monitoring, reporting, data-mining tools to do performance and capacity-related tests and analysis.
  • Provide deepest visibility to what is going on for all products: Run capacity and performance experiments to determine scaling and utilization parameters for various service tiers.
  • Own company server budget and track it: Present performance and capacity roadmap for critical project and cost analyses in presentation and written form monthly to executive teams.
  • Find the game changers and bring them on: Work with financial analysts, operations and engineering to perform cutting-edge technologies investigation and cost analysis.
  • A lot of other cool work: Identify capacity-related issues proactively and work with systems, network, application operations and engineering teams to discover resolutions.

Requirements

  • MS degree in Computer Science, Electrical Engineering, Operations Research or other technical field preferred
  • Experience working in extremely large-scale computing infrastructures
  • Strong coding and scripting ability (C, C++, Java, php, python or perl)
  • Solid database experience, MySQL preferred
  • Experience with hadoop a plus
  • Strong analytical, budgeting and planning skills
  • Excellent interpersonal and communication skills
  • Proven ability to work effectively across multiple teams
EOE Minorities/Females/Protected Veterans/Individuals with a disability.
Použít nyní