about me

My profile picture

Hi! I'm Shefali, an MS in Applied Statistics candidate at Columbia University, set to graduate in December 2024. I have 5+ years of experience developing data-driven solutions in interdisciplinary fields including data science and analytics, consulting, education, and tech. Throughout my academic and professional journey, I've gained hands-on experience with the full data science lifecycle - from building ETL data pipelines and training ML algorithms to creating interactive visualizations that translate complex data into actionable insights for both technical and non-technical audiences.

As a graduate student, I've significantly expanded on my industry experience, deepening and diversifying my technical skillset. My coursework includes, but is not limited to, statistical analysis and inference, applied machine learning, computational statistics, and big data applications tools like Apache Spark and Hadoop. Beyond academics, I've acquired practical experience with industry-standard tools such as Docker, Airflow, and Git.

As seen in my experiences and education, I am excited about leveraging data to solve interesting problems, and am always eager to take on new challenges in the ever-evolving field of data science and machine learning. If you'd like to discuss potential collaborations, job opportunities, or just discuss new research papers over coffee, please visit the 'contact' section.



tech stack

Python Python

R

SQL

LlamaIndex LlamaIndex

PyTorch

Scikit-learn

Hugging Face Hugging Face

XGBoost

Power BI Power BI

Google Looker Studio

AWS

Google Cloud

Apache Spark

Apache Airflow

Docker

Git

GitHub

Jupyter

VS Code

RStudio RStudio

SpaCy

NLTK

Pandas

NumPy

SciPy

Tidyverse tidyverse

Matplotlib

Seaborn

ggplot2

Leaflet

Lattice

Python Python

R

SQL

LlamaIndex LlamaIndex

PyTorch

Scikit-learn

Hugging Face Hugging Face

XGBoost

Power BI Power BI

Google Looker Studio

AWS

Google Cloud

Apache Spark

Apache Airflow

Docker

Git

GitHub

Jupyter

VS Code

RStudio RStudio

SpaCy

NLTK

Pandas

NumPy

SciPy

Tidyverse tidyverse

Matplotlib

Seaborn

ggplot2

Leaflet

Lattice


get in touch

  • New York, NY, USA
  • shefali.shrivastava.contact@gmail.com