Epicareer Might not Working Properly
Learn More

Data Engineer

Salary undisclosed

Apply on


Original
Simplified

As a Data Engineer, you will be a vital part of our team, working closely with Back-End Developers, Front-End Developers, Product Owner, Quality Assurance, and Project Manager to support our Healthcare Management System (SIM-RS) project. Your main responsibilities will include designing and implementing ETL pipelines, managing data governance and quality standards, and ensuring optimized performance of our database systems. You'll play a key role in integrating our data infrastructure into the application stack, contributing to the technical documentation and providing monitoring for data quality.

Responsibilities:

  • Design and implement ETL pipeline services tailored for healthcare data needs
  • Establish and maintain data governance and quality standards across the project
  • Develop and optimize database queries to improve data processing performance
  • Create and manage monitoring systems to track data pipeline health and efficiency
  • Maintain a comprehensive data dictionary and relevant technical documentation
  • Respond to data requests, meeting deadlines for delivery as per project needs
  • Integrate ETL services with the existing application stack for seamless data flow
  • Implement robust error handling and recovery procedures within data processes

Requirements:

  • Minimum of 3 years of experience in a Data Engineering role or related field
  • Strong expertise with MongoDB and PostgreSQL for data management
  • Proficiency in Python for ETL development and data processing
  • Hands-on experience in designing and implementing ETL pipelines
  • Familiarity with Docker and Git for development and deployment processes
  • Skilled in database optimization and performance tuning
  • Experience with API development for data integration

Preferred Qualifications:

  • Previous experience in healthcare data systems or projects
  • Familiarity with Node.js and Vue.js for system integration tasks
  • Knowledge of Apache Airflow for workflow automation
  • Experience implementing data governance frameworks
  • Familiarity with monitoring tools like Prometheus and Grafana for system health checks

Expected Deliverables:

  • A fully functional and efficient ETL pipeline system
  • A framework for monitoring and maintaining data quality
  • Implemented database performance optimization solutions
  • Clear and organized technical documentation
  • Successful integration of data systems with current application