Job Description
This role focuses on providing technical support for the implementation of data strategies, with a specific emphasis on optimizing data infrastructure and workflows. The candidate will be responsible for designing, managing, and maintaining data warehouse systems to ensure efficient data processing and cost-effective solutions. Key responsibilities include the integration of diverse data sources into cloud-based data warehouses, the development and execution of data pipelines, and the continuous improvement of data quality and system performance. The position requires collaboration with cross-functional teams to align data strategies with business objectives and ensure compliance with data governance standards.
Key Responsibilities
- Implementing best practices in data warehouse design and management to reduce complexity and costs, leveraging tools such as DBT, SQL, Python, Airflow, and GitHub on the Google Cloud Platform.
- Designing and executing data ingestion processes to transfer data from various sources into the cloud data warehouse (BigQuery), ensuring data accuracy and timeliness.
- Supporting the integration of multiple data sources, including Google Analytics, transactional databases, SFTP servers, and public datasets accessed via APIs and webhooks, into BigQuery for centralized data management.
- Optimizing data warehouse performance through query tuning, schema design, and resource allocation, while monitoring data quality and implementing corrective measures.
- Establishing and maintaining data governance frameworks to ensure data integrity, security, and compliance with regulatory requirements.
- Collaborating with stakeholders to identify data needs, document processes, and provide technical guidance for data strategy implementation.
- Developing and maintaining documentation for data workflows, integration processes, and system configurations to support team knowledge sharing and future scalability.
- Participating in the evaluation of new tools and technologies to enhance data processing capabilities and improve operational efficiency.
Job Requirements
- Proven experience in data strategy implementation, with a strong background in data warehouse design and management on cloud platforms like Google Cloud Platform (GCP).
- Expertise in utilizing DBT (Data Build Tool), SQL, Python, Airflow, and GitHub for data transformation, automation, and version control in a cloud environment.
- Strong proficiency in working with BigQuery for data storage, querying, and analysis, including knowledge of its integration capabilities with external data sources.
- Experience in integrating diverse data sources such as Google Analytics, transactional databases, SFTP servers, and public datasets via APIs and webhooks.
- Technical skills in optimizing data warehouse performance, monitoring data quality, and implementing solutions to resolve data-related issues.
- Knowledge of data governance principles, including data classification, access control, and compliance with data regulations.
- Excellent communication and collaboration skills to work with cross-functional teams, including data engineers, analysts, and business stakeholders.
- Ability to document complex data workflows and system configurations clearly for team reference and audit purposes.
- Strong problem-solving abilities and attention to detail to ensure accurate and reliable data processing outcomes.
- Preferred qualifications include certifications in cloud computing (e.g., Google Cloud Professional Data Engineer), data engineering, or related fields, as well as experience with data pipeline development and cloud-based analytics tools.