Data Engineer

About Healint

Healint is a leading maker of healthcare technology used all over the world. Healint leverages innovative techniques in software, data science and user experience design to empower people to manage their chronic conditions and diseases.

Healint’s first global program - the Migraine Buddy platform and its apps - helps a thriving community of users manage and track their migraines. To date, Migraine Buddy has recorded terabytes of data that helps patients, doctors and researchers better understand the real-world causes and effects of neurological disorders.

We're committed to revolutionizing healthcare technology, and are continually looking to add talented people to the Healint team. We promise challenging problems, an opportunity to have real impact on people's lives, and an environment where you'll learn rapidly from one of the best teams in Singapore.


As a Data Engineer you’ll be working on collecting, storing, processing, and analysing the 250GB (and growing!) of data we receive every week. Your number 1 goal is to help us turn all this data into insights. This also involves helping build machine learning algorithms by preparing and processing training and testing datasets.

We’re expecting a well-rounded profile for this position. You need to feel comfortable being responsible for our analytics infrastructure.

Our current data stack: Redshift/PostgreSQL, Airflow, Python & Tableau

Responsibilities

  • Maintain and improve our Redshift data warehousing system: Databases, ETL/ELT, data streaming system

  • Monitoring data integrity, performance, advising and implementing necessary infrastructure changes

  • Selecting and integrating any Big Data tools and frameworks (EMR Spark, AWS Athena, etc.) required to provide requested capabilities

  • Participating in data product development, with a focus on:

    • The implementation of practical machine learning solutions

    • Bringing data solutions in production (REST API)

Skills

  • 2-3 years experience in software engineering/ data engineering / ops

  • Hands-on working experience with large-scale datasets

  • Databases: Practical knowledge with SQL and no-SQL databases.
    You’re comfortable with querying and writing to databases.

  • Very proficient in Python.

  • Linux sys-admin skills

  • Self-starter, natural planner who looks ahead, raises issues, resolves them and meet deadlines

Pluses

  • Hands-on experience with Machine Learning (classification, clustering)

  • Proficient in a compiled language would be a plus.

  • Familiarity with AWS (DynamoDB, Redshift, S3, EC2, RDS)

  • Understanding of some BI Tools (Tableau, Qlikview, etc.)

  • Experience in creating a REST API that can handle a production load (code + deploy)

*Due to the COVID-19 pandemic, we are not able to consider candidates residing outside of Singapore*