Skip to main content
sean@portfolio:~$
$ whoami
Business Analytics Manager & Data Engineer
$ cat interests.txt
Data Engineering, Cloud Analytics, Machine Learning,
Business Intelligence, Predictive Analytics
$ ls skills/
AWS GCP Python PySpark SQL Tableau Airflow TensorFlow Hadoop Jenkins Linux Git
$ _
SZ

Sean Zhang

seanzhangd.com

Business Analytics Manager & Data Engineer

Leading data-driven solutions with 4+ years of experience in cloud analytics, predictive modeling, and team leadership. Specialized in transforming complex data into actionable insights that drive executive decision-making and operational efficiency.

8 Team Members Led
$1M+ Annual Savings
80+ Projects

// Certifications

GCP Professional Data Engineer
GCP Associate Cloud Engineer
AWS Certified Cloud Practitioner
EPIC Certified

// Technical Expertise

☁️

Cloud Platforms

GCP Professional Data Engineer GCP Associate Cloud Engineer AWS Cloud Practitioner BigQuery GCP Dataflow Cloud Storage
🔧

Data Engineering

Python PySpark SQL Airflow Hadoop ETL Automation Linux Git
🤖

Machine Learning & Analytics

TensorFlow Keras scikit-learn NLTK Pandas NumPy R (tidyverse, dplyr, caret) Predictive Analytics
📊

BI & Visualization

Tableau (Server Admin) Alteryx Power BI SAP
🗄️

Databases & Systems

Oracle SQL MSSQL MySQL Teradata EPIC (Cogito, Caboodle, Clarity)
👥

Leadership & Collaboration

Stakeholder Engagement Executive Reporting Cross-functional Collaboration Team Leadership Jenkins CI/CD Server Management

// Professional Experience

Business Analytics Manager

PNC Bank Feb 2025 - Present
  • Lead a team of 8 within a 20+ member cross-country group, overseeing automation, analytics, and predictive reporting for key security domains
  • Manage Information Security Department budget data and executive reporting for CIO and board members
  • Oversee security domains including Data Protection, WIAM, Digital Identity, ASM, and Threat Hunting
Python Tableau PySpark SQL

Business Analytics Lead

PNC Bank Apr 2023 - Jan 2025
  • Developed and managed entire workflow from data engineering to Tableau dashboards providing 30+ real-time KRI metrics
  • Extracted data from 10+ APIs and developed ETL pipelines using Python, PySpark, SQL, and shell scripts
  • Spearheaded automation initiatives saving $1M+ and 10,000+ hours annually, increasing operational efficiency 3-10x
  • Supported ML prediction models improving individual risk management accuracy by 80%
Python PySpark Tableau Jenkins

Advisor, Data Management and Governance

Cardinal Health Apr 2022 - Feb 2023
  • Built and optimized ETL pipelines integrating SQL Server and GCP from Workday, ServiceNow and third-party APIs
  • Led migration of 50+ ETL workflows from GCP SQL instances to BigQuery, doubling read/write speed
  • Administered Tableau Server managing 90+ dashboards for 300+ stakeholders across HR, Finance and Executives
  • Developed predictive modeling for Turnover Risk Prediction serving 10+ C-level executives
GCP BigQuery Python Alteryx

Data Specialist

St. Luke's University Health Network Jul 2021 - Mar 2022
  • Created 40+ reports using SQL and Tableau for clinical operations from Hybrid Clinical Database
  • Integrated Tableau Server, EPIC, SAP into unified reporting system, reducing delivery times by 40%
  • Developed Twilio Call Center reporting system covering 100+ configurations for 300+ agents managing 20,000+ calls daily
  • Certified in EPIC Cogito, Caboodle, Clarity, and Clinical Data model
SQL Tableau EPIC SAP

// Featured Projects

Real-time Security Analytics Platform

Developed comprehensive analytics platform providing 30+ real-time KRI metrics for information security, integrating data from 10+ APIs with automated ETL pipelines and executive dashboards.

Python PySpark Tableau Jenkins

GCP to BigQuery Migration Pipeline

Led migration of 50+ ETL workflows from GCP SQL instances to BigQuery, doubling read/write speed and enabling scalability for 300+ stakeholders across HR, Finance and Executives.

GCP BigQuery Dataflow Alteryx

Predictive Risk Analytics Models

Developed machine learning models for individual risk scoring, insider fraud prediction, and turnover risk prediction, improving risk management accuracy by 80% and serving C-level executives.

Python scikit-learn TensorFlow R

// Education & Certifications

Master of Science in Business Analytics

Washington University in St. Louis January 2021

GPA: 3.97/4.0

Honors: Charles F. Knight Scholar Award (TOP 1%), Beta Gamma Sigma

Activities: Olin Big Data Association, Project Portfolio, Kaggle Competitions

Bachelor of Marketing

Xiamen University June 2019

GPA: 3.61/4.0

Minor: Software Engineering

Exchange: McGill University, Montreal (Full Scholarship - TOP 1%)