Databricks Data Engineer l Contract
reference-number: 159889
industry: Information and Communications Technology
brand-id: R1441955
brand-name: 02C3423
Job Responsibilities
- Design, develop, and maintain scalable data solutions to improve data accessibility, usability, and reliability across the organization.
- Build, optimize, and support end-to-end data pipelines and ETL/ELT processes for data ingestion, transformation, and integration.
- Collaborate with business stakeholders and cross-functional teams to understand data requirements and deliver effective data-driven solutions.
- Query, cleanse, transform, and validate large datasets using SQL, Python, Databricks, and PySpark.
- Monitor, troubleshoot, and optimize data pipelines to ensure high performance, scalability, and operational reliability.
- Conduct regular data quality assessments and implement controls to ensure data accuracy, consistency, and integrity.
- Ensure compliance with data governance standards and relevant government requirements, including IM8 standards where applicable.
- Develop, enhance, and maintain dashboards and reports for operational monitoring, performance tracking, and user behavior analytics.
- Translate business requirements into meaningful visualizations and actionable insights using tools such as Power BI and Tableau.
- Create and maintain comprehensive technical documentation, including data workflows, process documentation, dashboard user guides, and data dictionaries.
- Maintain and update knowledge repositories (e.g., Confluence) to support knowledge sharing and operational continuity.
- Stay current with emerging technologies, industry trends, and best practices in data engineering and analytics, driving continuous process improvements and innovation.
Job Qualification
- Bachelor's Degree in Computer Science, Information Systems, Data Engineering, Data Science, or a related discipline.
- Minimum 5 years of hands-on experience in Data Engineering, including data pipeline development, ETL/ELT processes, and data integration projects.
- Proficiency in SQL and Python for data processing, transformation, automation, and analytics.
- Hands-on experience with Databricks, PySpark, or similar big data and analytics platforms.
- Experience designing, developing, and maintaining scalable and reliable data pipelines.
- Proficiency in dashboard development and data visualization tools such as Power BI, Tableau, or equivalent platforms.
- Good understanding of data quality management, data governance, and data validation practices.
- Experience producing high-quality technical documentation, user guides, and process documentation.
- Familiarity with system testing methodologies, including SIT, UAT, and QA testing.
- Knowledge of Apache Spark and distributed data processing frameworks.
- Working knowledge of Git or other version control systems.
- Demonstrated ability to work independently while collaborating effectively within cross-functional teams.
- Highly organized, detail-oriented, and committed to delivering high-quality solutions in a fast-paced environment.
