Data Engineer

Remote, Silicon Valley, CA, United States • $112k - $140k

Citrine Informatics

Role Locations

  • Remote
  • Silicon Valley, CA, United States


$112k - $140k


101 - 250 people


702 Marshall St Ste 520
Redwood City, CA, 94063-1826, US

Tech Stack

  • PostgreSQL
  • DynamoDB
  • AWS
  • JVM
  • JavaScript
  • React
  • Scala
  • Python

Role Description

The Data Engineer will lead efforts to build software tools to bring customer data onto the Citrine platform, validate its accuracy, and analyze its content.

About Citrine

At Citrine, we’re changing the way new materials are developed.

We are the industry leader in materials informatics, the application of data-driven methods to materials and chemicals development. Our platform provides data management and AI tools that help our customers rapidly develop better, more sustainable materials. Our users are scientists and engineers at huge manufacturing and materials companies, and researchers at leading universities and government labs. Our platform enables our users to accelerate the development of new materials.

In 2020 Citrine was recognized for technology innovation by the Global CleanTech Group and was named one of the most promising AI startups by CB Insights. As a team, we are ambitious with our goals, passionate about our vision, and eager to grow and learn from each other. Our team is growing fast and looking for the best to join us.

Though our technology was originally built by materials scientists, our team now consists of professionals trained in a diverse set of fields, including data science, physics, biology, and computer science. We have offices in the San Francisco Bay Area, Chicago, and Pittsburgh, and our customers include Fortune 1000 materials and product companies.

About the Role

Data are the lifeblood of both Citrine and our customers. To our customers, their data not only represent the distilled knowledge of decades worth of research, but also the foundation from which they can build artificial intelligence models of materials behavior using the Citrine platform. Materials data come in many forms, and customer data are often messy and heterogeneous. In order for customers to realize the full value of their data, it must first be brought onto a common platform, cleaned, and validated. The Data Analysis Engineer will have the technical responsibility of writing code to facilitate the structuring, organization, and curation of our customers’ materials data onto the Citrine platform. Furthermore, they will build and maintain tools to analyze the data files to determine their type, structure, and content, which can be used to help customers understand the tradeoffs between data quality and value. Responsibilities Engage directly with customers to understand the state of their historical scientific data and their scientific data systems Guide and teach customers on how their data should be integrated into the Citrine platform to maximize its scientific and business value without incurring unnecessary ingestion effort Build software tools to improve, validate, and monitor the data pipeline Prototype data tooling for Citrine's materials science data platform Select, structure, verify, and process large sets of materials data from a variety of sources and formats for inclusion in Citrine's materials science data platform Skills and Qualifications B.S. degree in the physical sciences (e.g. chemistry, materials science, physics) Strong programming in python (not just scripting) Database querying (SQL, Access, etc.) Must be legally eligible to work in the United States Preferred Skills and Qualifications Strong Python code development experience, including contributions to open source or collaborative repositories Experience building, scheduling, scaling and maintaining ETL pipelines Experience with pipeline automation and integration tools (Airflow, Luigi, Tibco etc.). Equal Opportunity

All qualified applicants will receive consideration for employment without regard to race, creed, color, or national origin.

Our Core Values

Citrine Informatics recognizes that its most valuable asset is its people. We have created our set of Core Values to encourage, support, and invest in our team as they work to innovate and support a more sustainable world. Our Core Values reflect our ongoing commitment to continuously invest in nurturing our talent and our people-first approach to conducting business.

  • We take pride in and recognize the successes and growth of ourselves and our colleagues. We support each other in our growth.
  • We prototype and collect data to make good decisions. We question that data and are constantly iterating to find the best solution.
  • We are all owners of Citrine and make decisions like owners. We work autonomously with personal and organizational accountability.
  • We commit to building a diverse and inclusive community within Citrine and actively promote equity and belonging.
  • We are tirelessly committed to creating value for our customers.
  • We exist to help our customers accelerate the development of sustainable products that are critical to the future of both our planet and our industry.

Our Benefits (for exempt, full-time employees based within the United States)

401k with matching up to 4% Medical, vision, dental insurance (we pay 100% of your premium and 75% of your dependents) Equity options within the company Parental leave Flexible PTO on top of our 15 paid company holidays (includes your birthday!) Free financial counseling $600 tech allowance Monthly $75 phone reimbursement Pre-tax commuter benefits $5,000 annual professional development/growth allowance

About Citrine Informatics

The future of materials development depends on speed. Developing materials faster will require managing and using data more effectively, which includes consolidating data into a single consistent searchable format, as well as structuring, storing, and using materials data to harness the power of artificial intelligence. Our product combines a consolidated materials repository, the world’s largest materials dataset, and powerful artificial intelligence to accelerate materials discovery.

Company Culture

Citrine Informatics’ artificial intelligence technology is changing and accelerating the way materials and products are developed and as a result, the environment is fast-moving, intense, fun and supportive. The culture of the company is driven by our company values. Candidates who align with the following values will thrive:

  • We act with autonomy and we are accountable for performance excellence;

  • We use data to make good decisions;

  • We live up to our commitments and get the job done and have a bias toward execution;

  • We are committed to our internal team through recognizing our colleagues for their successes and assistance, and to our external community through making positive contributions when possible;

  • We value diverse opinions, perspectives and backgrounds.

These values influence how we work, prioritize our time, evaluate our performance, interact with colleagues and customers, as well as candidates for employment.

Interested in this role?
Skip straight to final-round interviews by applying through Triplebyte.