Skip to content

Performance and Capacity Engineer
Company | Meta |
---|
Location | Menlo Park, CA, USA |
---|
Salary | $258524 – $290180 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Mid Level, Senior |
---|
Requirements
- Requires a Bachelor’s degree in Computer Science, Engineering, Applied Sciences, Applied Mathematics, Physics or a related field. Foreign Degree equivalent accepted.
- Requires completion of a university-level course, research project, internship, or thesis in the following:
- 1. Python, C, C++, C #, or Java
- 2. Linux or Unix as evidenced by file manipulation, advanced commands, and shell scripting
- 3. Software development tools: Compilers, and revision control systems
- 4. ML frameworks such as PyTorch or Caffe2
- 5. Technical presentation skills
- 6. Computer architecture and microarchitecture
- 7. Performance analysis, tuning and optimization
- 8. Machine Learning/Deep Learning and Recommendation Systems
Responsibilities
- Scaling the largest web capacity in the world.
- Work with Product Engineering, Infrastructure Engineering, and Data Engineering teams to find the optimal way to scale the infrastructure, which encompasses tens of billions of user requests, hundreds of peta bytes of data, and thousands of giga bps of network flow.
- Own end-to-end product design, launch, and operation.
- Support architecture design, define networking requirements, and help code build from scratch to support new product launch.
- Tackle the state-of-the-art hardware performance issues as well as analyze and debug difficult server performance issues (latest in industry), identify bottlenecks, and optimize product/service performance to improve user experience.
- Solve the hardest software performance issues by working with software developers to improve code base performance (e.g. algorithm redesign), and reduce resource consumption and shorten request latency.
- Plan the largest server and datacenter capacity and own and drive overall Meta capacity planning work for all different products/services and recommend DC expansion plan.
- Develop tools to monitor billions of user requests by writing monitoring, reporting, data-mining tools to do performance and capacity-related tests and analysis.
- Provide the deepest visibility to what is going on for all products.
- Run capacity and performance experiments to determine scaling and utilization parameters for various service tiers.
- Own company server budget and track it.
- Present performance and capacity roadmaps for critical projects and cost analyses in presentations and written for monthly to executive teams.
- Collaborate with financial analysts, operations and engineering to perform cutting-edge technologies investigation and cost analysis.
- Identify capacity-related issues proactively and work with systems, network, application operations and engineering teams to discover resolutions.
Preferred Qualifications
No preferred qualifications provided.