View Our Website View All Jobs

Data Engineer

ThinkCERCA is an education technology startup that helps educators teach critical thinking through argumentative writing. Named a "game-changing" education tool by Bill Gates, our product is proven to help students achieve two years of academic growth per year. We're on a mission to make these results a reality for all students across the country.

We’re looking for someone is who passionate about web development, works well with a team, and is interested in joining a fast-paced startup as a data engineer to drive our product to the next level. Primary skills desired include proficiency with scripting languages, service oriented architecture, Redshift, and Postgres.

 

What you’ll be doing

  • Building out our data and reporting APIs by expanding our core Redshift database and creating accessible microservices for views into specific slices or targeted transformations.
  • Working with vast swaths of standards-based educational data to improve the critical thinking capacity and writing ability of students in the US and, eventually, throughout the world.
  • Maintaining the absolute privacy and anonymity of any data that you work with or expose to the wider internet.
  • Working with other senior engineers on occasional low level DevOps projects to improve the efficiency of our infrastructure and development process
  • Competing for the coveted Best With Computer award during our bi-monthly hack nights in obscure new programming languages
  • Interfacing directly with our QA team to ensure that the features you build have been rigorously verified and are up to spec
  • Meeting with our stakeholders to understand and refine feature requirements throughout their implementation
  • Making our team better with your unique viewpoints and interests

 

What you’ll need

  • Experience with or willingness to work in either Ruby or Elixir, which are already in our stack, or one of the following languages, listed in order of preference: Julia, Python, or R. (Bonus points if you’ve worked with any of the machine learning libraries in any of the above languages)
  • At least some basic knowledge of how GraphQL works to help understand the type of data that is sent and received from our main front-end
  • Knowledge of data integrity testing best practices and version control (we use git)
  • An understanding of the many difficulties and necessary tasks in data services that run over the internet, including: handling unnecessary duplication, anonymizing non-private metadata, intermittent connectivity during loading tasks (using transactions, leveraging idempotence), combining data from multiple asynchronous services, preventing vulnerabilities when decrypting and processing encrypted information, etc
  • A desire to continually improve both the stack that we all work in and your own skillset
  • The ability to give and receive thoughtful, constructive feedback on any incoming changes
  • An awareness of when to act, delegate, and ask for help
  • An appreciation of discussion and diverse viewpoints
  • Big Bonus points: knowledge of education or education technology
Read More

Apply for this position

Required*
Apply with Indeed
Attach resume as .pdf, .doc, or .docx (limit 2MB) or Paste resume

Paste your resume here or Attach resume file

150