Skip to content
View tenapril's full-sized avatar
😎
Data Plumber with interest in IOT and Pipelining
😎
Data Plumber with interest in IOT and Pipelining

Block or report tenapril

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tenapril/README.md

πŸ‘‹ Hello, I'm Andrew Tirto Kusumo!

Online Portofolio CV

A Data Engineer with a passion for making teams work faster and better by building efficient data pipelines and creating scalable data solutions. I enjoy analyzing and wrangling data to drive actionable insights, while constantly improving processes and tools.

With extensive experience across various technologies such as Google BigQuery, DBT, Apache Beam, AWS, GCP, Docker, Airflow, and more, I’ve developed streaming pipelines from scratch, optimized data warehouses, and led engineering teams to success.

πŸ› οΈ Skills & Expertise

  • Data Engineering: ELT/ETL pipelines, real-time streaming, data warehousing, data modeling
  • Cloud Platforms: Google Cloud, AWS, Alibaba, Azure Machine Learning
  • Tools & Technologies: DBT, Apache Beam, Airflow, Docker, Datastream, Pub/Sub
  • Languages: Python, SQL, R, Java, C++, PHP, Assembly
  • Machine Learning: Feature engineering, model deployment, predictive modeling
  • Leadership: Team management, career development, hiring

πŸ“Š Projects & Achievements

Flip.id (2021 - Present) | Data Engineer Manager

  • Developed a real-time streaming pipeline from scratch using Google Datastream, Pub/Sub, and Dataflow, enabling seamless data ingestion from application databases to BigQuery.
  • Implemented an end-to-end ELT pipeline with DBT, including testing and query dependencies, reducing BigQuery costs by 20% through optimized partitioning and clustering.
  • Led the Data Engineering team by creating job descriptions, career frameworks, entry tests, and interview processes, successfully hiring a new team from zero.
  • Created a credit scoring proof-of-concept (POC) for Flip’s lending product using Docker and FastAPI.
  • Provisioned Redash and Looker Studio for data analytics and visualization, empowering stakeholders with actionable insights.

JULO (2018 - 2021) | Senior Data Engineer

  • Ensured the smooth operation of a 24/7 Airflow-based data pipeline, resolving issues and ensuring uptime.
  • Designed and implemented a PostgreSQL partitioning strategy for large datasets, significantly improving query performance and scalability.
  • Automated daily Facebook, Google, and Appsflyer API ingestion pipelines, increasing operational efficiency.

PHI-Integration (2017 - 2018) | Data Engineer & Analyst

  • Modeled product prices using SARIMA and linear regression models in Azure Machine Learning.
  • Enhanced location data with Google Maps Geocoding and visualized event data for a Marathon Event.
  • Developed a news crawler to track and analyze media trends using R.

🌟 Open Source Contributions

  • KUACI: Developed an open-source KYC solution for Indonesian KTP data, translating KTP numbers into location, gender, and DOB.
  • Contributed to GitHub Arctic Code Vault as part of the KUACI project.

πŸ… Education & Certifications

  • Bachelor of Computer Science from Multimedia Nusantara University, GPA: 3.78/4.0
  • Thesis: Speech Recognition Analysis using CNN for Indonesian Language
  • Certifications: TOEIC (950/990), Japanese JLPT N3

πŸ† Honors & Awards

  • Academic Scholarship: Awarded to the top 5% of students at Multimedia Nusantara University.

πŸš€ Want to Collaborate?

I’m always open to new challenges and collaboration opportunities. Let’s build something amazing together!

πŸ“§ andrewtirtokusumo@gmail.com | 🌐 LinkedIn | πŸ‘¨β€πŸ’» GitHub


Top Languages

Pinned Loading

  1. kuaci kuaci Public

    Indonesian KTP Validator + Enrichment [Open Source]

    Jupyter Notebook 27 4

  2. kamus-geografi-speech-recog-tensorflow kamus-geografi-speech-recog-tensorflow Public

    kamus geografi dengan tensorflow

    Python

  3. Automated-Warehouse-Arduino Automated-Warehouse-Arduino Public

    Automated Warehouse using Arduino

    Arduino

  4. apache/airflow apache/airflow Public

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

    Python 37.1k 14.3k

  5. Postgres Partitioned Table Monthly S... Postgres Partitioned Table Monthly Script DDL Generator
    1
    ## Postgres Partitioned Table Monthly Script DDL Generator
    2
    USING PYTHON3
    3
    1) this script was created based on [this](https://www.postgresql.org/docs/10/ddl-partitioning.html)
    4
    2) writing a DDL for every single table is time consuming so i created this script, this script's purpose is to create a monthly postgres partitioned table based on the date
    5
    3) this will create a partitioned table with name like child_table_y2020m01