Role Location
- San Francisco, CA, United States
Employees
Address
Tech Stack
- Python
- Tensorflow
- OpenCV
- Cython
- JavaScript
- React
- React Native
- Flask
- Rust
- Lustre
Role Description
Responsibilities: Build observability into products to measure and monitor availability, performance and overall health of systems Engage in the development cycle by consulting on system design, building software frameworks, testing and tooling Monitor deployed hardware for imminent failure cases. Improve product life cycle through design, deployment, operation, maintenance and refinement Leverage observability to improve operations and drive development towards increased reliability, performance and accuracy Identify and integrate with third-party solutions when applicable. Scale development processes and deployments through automation and orchestration Build systems that scale across thousands of physical locations Lead incident response strategy and postmortems.
Requirements: Rust Deep desire and ability to automate routine tasks, debug and optimize software Expertise with Unix operating systems and system administration Experience with configuration management tools, distributed systems, cloud/on-prem orchestration and provisioning Familiarity with open-source and third-party observability tools. Incident response and management experience. Practical experience with tools such as Prometheus, Grafana, Fluentd, ELK stack, etc
Nice to have: Python and Bash Expertise in designing, analyzing and troubleshooting large-scale distributed systems. Experience working with Mesos or another cluster scheduler / resource manager Comfortable configuring and deploying Nix/NixOS based systems Linux tracing tools such as lttng, dtrace, perf, etc Experience doing ops in bandwidth constrained environments
About Standard Cognition
We build a machine vision solution that allows automatic and instant checkout in stores. Shoppers go to a store, grab what they want, and then leave. No lines and no scanning.
Company Culture
We value independence, candor, and grit. We're solving hard problem and we value people that love taking ownership and drilling into finding a solution. We also value and encourage failure. Most approaches to the problems we work on will fail. Sharing those failures is an important way to move the entire team forward. Because of that we value people's implementation of approaches, not whether those approaches pay off.
Address
Tech Stack
- Python
- Tensorflow
- OpenCV
- Cython
- JavaScript
- React
- React Native
- Flask
- Rust
- Lustre
Skip straight to final-round interviews by applying through Triplebyte.