Housing Study:
Examining Homelessness and Housing Intervention in New York State and New York City
This project performs quantitative and qualitative analysis of housing data that is availability for New York State and New York City. The study was done at the beginning of 2024. Quantitative data was obtained from the federal government's Department of Housing and Urban Development (HUD), and qualitative data was obtained from peer-reviewed articles and current news events. The below PDF is a presentation I gave about the results I found and conclusions I drew from the data.
Dataspeak:
Creating a Chatbot Using a LLama2 Large Language Model
This project is for an externship to create a customer service chatbot. It is trained on data from Stack Overflow, specifically questions about Python. It uses models from SentenceTransformer and Llama2, then leverages ConversationalRetrievalChain in order to generate Question/Answer responses. The UI was built using Chainlit and deployed initially on Google Colab. After that, I created a Docker image and then deployed on AWS.
Interconnect:
A Comprehensive Project to Recommend Customer Retention Strategies
This project provides exploratory and statistical data analysis and evaluates multiple machine learning models, both with and without gradient boosting, in order to make recommendations to a media company about how to best predict customer churn and make plans to retain customers.
Rusty Bargain:
Gradient Boosting Models
This project evaluates gradient boosting machine learning models to make a recommendation on the best model to make predictions on the market value of used cars.
Sure Tomorrow Insurance:
Linear Algebra, Nearest Neighbors, and Data Masking

This project utilizes linear algebra and machine learning to find customers similar to given customers, as well as make predictions about the benefits that a new customer is likely to receive.
Good Seed:
The Use of Computer Vision to Prevent Underage Sales

This project uses Computer Vision to determine if a grocery store can figure out age by photo at a self checkout for restricted items.
Oily Giant:
Machine Learning using Bootstrapping

This project predicts the profits for the top 200 wells in a region that is most profitable for a mining company.
Zyfra Gold Recovery:
Data Analysis and Machine Learning Models

This project examines different machine learning models to evaluate the best one to predict the maximum ore extracted in the ore purification process.
Film Junkie Union:
Natural Language Models

This project evaluates different language learning models based on IMDB reviews and then uses a set of written reviews to compare the trained classification models.
Sweet Lift Taxi:
Machine Learning and Time Series

This project evaluates different machine learning models on a time series to make predictions on how to maximize the number of taxi orders in an hour.
Megaline New Plan Choice:
Machine Learning and Classification

This project creates a machine learning model to predict how legacy customers of a communications company will choose the newest subscription model to roll over to.
Beta Bank:
Machine Learning fine-tuned by F1 Score

This project creates a machine learning model that is fine-tuned by evaluating F1 score for a bank to help it predict which customers are likely to leave in the near future in order to prevent customer churn.
Zuber:
Data Analysis Using SQL and Python

This project performs data preprocessing and exploratory and statistical data analysis after SQL queries narrowed down the scope of the data to the correct date and locations in order to help a ride-sharing application make decisions about its market share.
Megaline:
Data Analysis to Determine Customer Plan Optimization

This project contains an exploratory and statistical data analysis of consumer data for a communications company that provides calling, texting, and internet services to customers.
Ice Games:
Exploratory and Statistical Data Analysis to Determine Game Sales

This project contains an exploratory and statistical data analysis of consumer data for a communications company that provides calling, texting, and internet services to customers.