
Sergio Sánchez Zavala
Senior Data Engineer & AI Operations Leader @ TalkingPoints
Founder of @tacosdedatos
TalkingPoints
tacosdedatos
LoQueAndoOyendo
Biography
My name is Sergio Sánchez Zavala and I’m originally from Tijuana, Baja California, México. I’m a Senior Data Engineering Leader with 7+ years architecting enterprise-scale data infrastructure and pioneering AI-powered operations. I’m dedicated to making research transparent and reproducible while transforming data capabilities at mission-driven organizations.
I’m passionate about building data platforms that scale, implementing AI/ML solutions that drive real impact, and establishing data governance frameworks that enable organizational strategy and innovation.
I’m also:
- 🌮📊 the founder of @tacosdedatos - tacosdedatos.com
- the premier Spanish-language data science community with 1,100+ active members, 10K Twitter followers, and 20K TikTok followers
- 🧑🏼🔬🎨 a senior data engineer at TalkingPoints - where I lead data infrastructure powering multilingual communication for 5M+ students, families, and educators
Interests
- Data Visualization
- Social Tech
- Public Policy
Education
-
B.A. in Economics, 2016
University of California, Davis
-
B.A. in International Relations, 2016
University of California, Davis
Experience
Senior Data Engineer
TalkingPoints
- Scaled Data Infrastructure: Designed and optimized Snowflake + dbt pipelines processing 1.5B+ messages, enabling near real-time analytics and reporting for product and business teams
- Pioneered AI/ML Capabilities: Built and deployed production NLP pipelines for sentiment analysis, intent recognition, and entity extraction; implemented CLIO-inspired conversation segmentation and topic clustering delivering actionable insights
- Drove Strategic Impact: Partnered with product, research, and operations teams to deliver data products that inform strategy, measure program impact, and improve student outcomes
- Established Data Excellence: Implemented data governance standards and documentation practices, reducing ad-hoc data requests by 40% while increasing organizational trust in metrics
- Optimized Operations: Led performance tuning initiatives that reduced Snowflake costs by 70%, accelerating decision-making while saving significant resources
- Championed AI Operations: Advocated for and built AI-Ops initiatives, integrating LLMs into internal workflows to boost cross-team productivity and automation
Data Engineer
TalkingPoints
- Developed initial NLP pipelines for message analysis and sentiment extraction
- Collaborated with product teams to establish key metrics and reporting frameworks
- Built data infrastructure supporting platform growth and user analytics
- Earned promotion to Senior Data Engineer in recognition of technical leadership and impact
Data Engineer
Alluma
- Modernized Data Architecture: Led adoption of modern data models and architecture patterns, developing and managing Snowflake data warehouse and Airflow data pipelines
- Enabled Self-Service Analytics: Designed and built dashboards and self-service tools in various BI technologies for internal teams and client-facing solutions
- Drove Data Culture: Conducted technical training on data concepts and tools; developed best practices for data analysis, reporting, and visualization
- Cross-Functional Leadership: Collaborated with functional area leaders to identify opportunities for operational efficiency and data-driven transformation
Data Visualization Analyst
Alluma
- Developed ETL jobs and tests to process, validate, and distribute data across multiple systems
- Created comprehensive documentation and style guides for data visualization best practices
- Built client-facing analytics solutions and internal operational dashboards
- Conducted technical training on data concepts and visualization tools
Founder
tacosdedatos
- Research, write and publish content as well as reach out to potential content creators.
- Create marketing materials for our content and push them on to the right channels
- Maintain the website t acosdedatos.com/
Research Associate
Public Policy Institute of California
- Conducted advanced statistical analyses on immigration and education policy impacts
- Published peer-reviewed research reaching national audiences
- Presented technical workshops at major conferences (PyCon US 2019/2020, NACIS)
- Developed open-source tools for census data analysis and geospatial policy research
Data Analyst
Davis Joint Unified School District
Notable Achievements & Leadership
Community Building & Thought Leadership
Founder, tacosdedatos.com
Built premier Spanish-language data science community reaching 1,100+ active members, 10K Twitter followers, and 20K TikTok followers
Open Source Impact
Created Datawrapper Python SDK, pypums (Census data access), and spotify-to-sqlite - tools actively used by data teams at major news organizations and research institutions
Conference Speaking
PyCon US 2024
Ingenería de datos para mi salud mental (Data Engineering for my Mental Health) - Spanish-language talk
PyCon US 2020
Geospatial Public Policy Analysis with GeoPandas - exploring how to apply pandas power to geospatial data
PyCon US 2019
Analyzing Census Data with Pandas - tutorial on working with Census data from IPUMS using pandas
NACIS 2019
Making Maps with Python - open source tools in Python ecosystem for geographic data visualization
Data Day Mexico 2022
¿Qué es mejor que una buena fuente de datos? Dos buenas fuentes de datos (What's better than one data source? Two data sources)
Publications & Research
Immigrants in California
Co-authored research publication - Public Policy Institute of California (2019)
Consistently Beautiful Visualizations with Altair Themes
Technical article - Towards Data Science (2018)
Multiple Peer-Reviewed Publications
Research on education policy and economic mobility
Explore projects
Skills
Data Platform Architecture
Snowflake, dbt, Airflow, ETL/ELT design, real-time streaming
AI/ML Operations
NLP pipelines, LLM integration, sentiment analysis, topic modeling
Programming
Python (Advanced), SQL (Advanced), JavaScript, R
Cloud & DevOps
AWS, GCP, Azure, Docker, CI/CD
Machine Learning
scikit-learn, spaCy, Hugging Face, conversation segmentation
Analytics & Visualization
Tableau, Jupyter, pandas, Altair, d3.js, Datawrapper
Data Governance
Data modeling, documentation, quality frameworks
Certifications
Snowflake Advanced Architecture (2021), Snowflake Data Engineering (2021)