Analytics Engineer

salmon bank (rural bank)

Philippines

Fresher

Save

Posted 18 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

As an Analytics Engineer at Salmon, you will play a pivotal role in Data Modeling & Transformation (Databricks Silver & Gold Layers). You will work closely with Data Scientists, Engineers, and Business System Analysts to ensure that datasets align with business needs.

Key responsibilities

Data Modeling & Transformation

Design, build, and maintain scalable data models in Databricks silver (curated data) and gold (business-ready data) layers.
Define clear data contracts between silver and gold to ensure consistency and reliability.
Apply best practices for dimensional modeling (star/snowflake schemas) to support analytics and reporting.

Collaboration & Best Practices

Partner with data scientists, platform engineers, and business analysts to ensure gold datasets meet business needs.
Follow software engineering practices version control (Git), CI/CD for data pipelines, code reviews, and testing.
Contribute to the development of a shared analytics engineering framework (naming standards, reusable templates, testing frameworks).

ETL/ELT Development

Develop and optimize transformation pipelines (PySpark/SQL/Delta Live Tables/Databricks Workflows) to process data from bronze silver gold.
Implement incremental data processing strategies to minimize compute cost and improve pipeline performance.
Ensure data quality checks (validations, anomaly detection, deduplication, SCD handling, etc.) are built into transformations.

Data Quality & Governance

Establish and maintain data quality metrics (completeness, accuracy, timeliness) for silver and gold tables.
Apply data governance standards consistent naming conventions, documentation, and tagging across datasets.
Collaborate with data platform engineers to enforce lineage and observability.

Business Enablement

Work closely with analysts and business stakeholders to understand requirements and translate them into gold-layer datasets.
Build reusable, business-friendly datasets that power dashboards, self-service BI tools, and advanced analytics.
Maintain documentation (data dictionaries, transformation logic, lineage diagrams).

Performance & Optimization

Optimize Databricks SQL queries and Delta Lake performance (Z-ordering, clustering, partitioning).
Monitor and tune workloads to control compute spend on silver and gold pipelines.
Implement best practices for caching, indexing, and incremental updates.

Requirements and expectations