View Our Website View All Jobs

Data Engineer

ThinkCERCA is an education technology startup that helps educators teach critical thinking through argumentative writing. Named a "game-changing" education tool by Bill Gates, our product is proven to help students achieve two years of academic growth per year. We're on a mission to make these results a reality for all students across the country.

We’re looking for someone is who passionate about web development, works well with a team, and is interested in joining a fast-paced startup as a data engineer to drive our product to the next level. Primary skills desired include proficiency with scripting languages, service oriented architecture, Redshift, and Postgres.


What you’ll be doing

  • Building out our data and reporting APIs by expanding our core Redshift database and creating accessible microservices for views into specific slices or targeted transformations.
  • Working with vast swaths of standards-based educational data to improve the critical thinking capacity and writing ability of students in the US and, eventually, throughout the world.
  • Maintaining the absolute privacy and anonymity of any data that you work with or expose to the wider internet.
  • Working with other senior engineers on occasional low level DevOps projects to improve the efficiency of our infrastructure and development process
  • Competing for the coveted Best With Computer award during our bi-monthly hack nights in obscure new programming languages
  • Interfacing directly with our QA team to ensure that the features you build have been rigorously verified and are up to spec
  • Meeting with our stakeholders to understand and refine feature requirements throughout their implementation
  • Making our team better with your unique viewpoints and interests


What you’ll need

  • Experience with or willingness to work in either Ruby or Elixir, which are already in our stack, or one of the following languages, listed in order of preference: Julia, Python, or R. (Bonus points if you’ve worked with any of the machine learning libraries in any of the above languages)
  • At least some basic knowledge of how GraphQL works to help understand the type of data that is sent and received from our main front-end
  • Knowledge of data integrity testing best practices and version control (we use git)
  • An understanding of the many difficulties and necessary tasks in data services that run over the internet, including: handling unnecessary duplication, anonymizing non-private metadata, intermittent connectivity during loading tasks (using transactions, leveraging idempotence), combining data from multiple asynchronous services, preventing vulnerabilities when decrypting and processing encrypted information, etc
  • A desire to continually improve both the stack that we all work in and your own skillset
  • The ability to give and receive thoughtful, constructive feedback on any incoming changes
  • An awareness of when to act, delegate, and ask for help
  • An appreciation of discussion and diverse viewpoints
  • Big Bonus points: knowledge of education or education technology
Read More

Apply for this position

Apply with Indeed
Attach resume as .pdf, .doc, or .docx (limit 5MB) or Paste resume

Paste your resume here or Attach resume file