Careers at Facebook

Software Engineering

Software Engineer, Data Infrastructure (Giraph team)

स्थलMenlo Park, CA
Facebook was built to help people connect and share, and over the last decade our tools have played a critical part in changing how people around the world communicate with one another. With over a billion people using the service and more than fifty offices around the globe, a career at Facebook offers countless ways to make an impact in a fast growing organization.
Are you passionate about building systems for large scale data processing and analytics?

As an experienced engineer on the Analytics Infrastructure team at Facebook, you will work on our platforms for graph and iterative/ML algorithms that operates at massive scale.

The ideal candidate would be interested in working on the algorithms and platform – working closely with our product teams to understand their use cases and build new applications like collaborative filtering, clustering and machine learning algorithms, while also driving forward the platform to scale as existing applications grow and new applications are added. If you frequently think to yourself, 'Yeah, that works for 100 billion edges; but what about a trillion edges or more?', come join us!

Giraph is used extensively at Facebook to power a variety of large scale graph and iterative algorithms used in our products. Here is a blog post on our work: https://www.facebook.com/notes/facebook-engineering/scaling-apache-giraph-to-a-trillion-edges/10151617006153920

This position is full-time and based in our main office in Menlo Park, CA.

Responsibilities

  • Develop platforms for the management and manipulation of massive data sets and graphs (100s of Terabytes of data)
  • Analyze and improve efficiency, scalability, and stability of the analytics platform
  • Work closely with product teams to understand their analytics use cases, and construct efficient algorithms for their use cases
  • Optimize our infrastructure at both the software and hardware level

Requirements

  • Extensive experience working with large scale data processing
  • Understanding of graph/iterative algorithms and analytics platforms (Giraph, MLBase, GraphX, GraphLab)
  • Expert knowledge developing and debugging in Java
  • 5+ years of relevant experience with filesystems, server architectures and/or distributed systems
EOE Minorities/Females/Protected Veterans/Individuals with a disability.
Apply now