Software Engineer, Machine Learning
- San Francisco, CA, United States
- Toronto, ON, Canada
- Apache Spark
Are you passionate about machine learning and looking for an opportunity to make an impact in healthcare?
Fathom is on a mission to understand and structure the world’s medical data, starting by making sense of the terabytes of clinician notes contained within the electronic health records of health systems.
We are seeking extraordinary Machine Learning Engineers to join our team, developers and scientists who can not only design machine-based systems, but also think creatively about the human interactions necessary to augment and train those systems.
Please note, this position has a minimum requirement of 3+ years of experience. For earlier career candidates, we encourage you to apply to our SF and/or Toronto locations.
As a Machine Learning Engineer you will:
You will develop NLP systems that help us structure and understand biomedical information and patient records
You will work with a variety of structured and unstructured data sources
You will imagine and implement creative data-acquisition and labeling systems, using tools & techniques like crowdsourcing and novel active learning approaches
You will work with the latest NLP approaches (BERT, Transformer)
You will train your models at scale (Horovod, Nvidia v100s)
You will use and iterate on scalable and novel machine learning pipelines (Airflow on Kubernetes)
*You will read and integrate state of the art techniques into Fathom’s ML infrastructure such as Mixed Precision on Transformer networks
We’re looking for teammates who bring:
3+ years of development experience in a company/production setting
Experience with deep learning frameworks like TensorFlow or PyTorch
Industry or academic experience working on a range of ML problems, particularly NLP
Strong software development skills, with a focus on building sound and scalable ML
Excitement about taking ground-breaking technologies and techniques to one of the most important and most archaic industries
A real passion for finding, analyzing, and incorporating the latest research directly into a production environment.
Good intuition for understanding what good research looks like, and where we should focus effort to maximize outcomes.
Bonus points if you have experience with:
Developing and improving core NLP components—not just grabbing things off the shelf
Leading large-scale crowd-sourcing data labeling and acquisition (Amazon Turk, Crowdflower, etc.)
We're building a deep learning system to structure and organize all the free text in patient medical records. Our first application for this system is automatically turning that text into health insurance billing codes. Each year, $3.5 billion is spent on using manual labor to solve this problem with only 83-85% accuracy achieved. Our financing was led by Google Ventures and 8VC and our engineering team comes from Google, Facebook, and Twitch.
We are driven and passionate, motivated by learning, personal growth and impact. We have a heavy emphasis on feedback and communication, running our OKRs on two weeks cycles and monthly one hour check-ins.
- Apache Spark
Skip straight to final-round interviews by applying through Triplebyte.