About

Hello! My name is Colin Suarez. I am a recent graduate from Saint Mary’s College of California where I received my M.S. in Business Analytics degree (Cohort 12, Dec 2022). I currently work as a business analyst at a chiropractic office but I am open to new opportunities.

I decided to make this career path change because I was reflecting on my undergraduate studies when I was at the University of San Francisco where I received my B.A. in Sociology with an emphasis in Criminology, Law, and Society. I started my undergraduate studies as a Computer Science major where I was able to begin learning Python and Java. I decided to switch career paths to try to become a lawyer which led me to switching majors. As time went on, I realized I enjoyed coding so I began my research on what programs I can get into in order to get back in the field which led me to come across St. Mary’s M.S. in Business Analytics program. During my time at St. Mary’s I felt that I was able to learn and grow as a coder and specifically a data analyst. With that being said, please feel free to check out the projects and coursework I was able to do during my time in this program. Thank you for your time, and hope to connect with you soon!

Projects

Super Store

The SuperStore dataset was found on Kaggle. Using the dataset, I did data cleaning and data exploration to uncover business insights and give business recommendations to help improve and expand business to beecome more profitable. Using data exploration, I was able to create data visualizations which were used as support to answer business questions and also give recommendations.

Skills and Concepts Demonstrated: Python, Pandas, MATPLOTLIB, Data Cleaning, Data Exploration, Data Analysis, Business Insights, Data Visualization

Explore Repository

NASA UAV Obstacle Avoidance Database Capstone Project

There is a growing demand for delivery systems to be more autonomous and efficient, specifically with the use of drones. In order to make drones more autonomous for delivery, we were tasked with the objective to create an obstacle avoidance database. The demand for an obstacle avoidance database has increased because drones can be utilized to make deliveries faster, more efficiently and most importantly autonomously, while taking care so that they do not crash into obstacles and using the most optimized flight path to the destination. Our purpose of creating an obstacle avoidance database is to develop a model that can determine obstacles from LiDAR and satellite imagery through machine learning and feed it into a database as new images of data are updated.

Skills and Concepts Demonstrated: Python, Numpy, Trimesh, Plyfile, LasPy, Pandas, Clustering, Data Segmentation, SQL Database, Google Cloud Virtual Machine, KPConv, TensorFlow, Keras, Machine Learning, Convolutional Neural Network, Data Modeling, YOLOv5, Roboflow, MapBox API, USGS, LiDAR

Project Team Members:
Kevin Brand (kbrand93@gmail.com)
Mark Santos (mksantos6677@gmail.com)
Steven Zhu (stevnzhu@gmail.com)

Explore Repository

Chipotle Marketing Project

In this project, our group was given a survey done by Chipotle. Chipotle wanted to improve customer satisfaction and retention. Our job was to first do data cleaning on the dataset which included removing rows with null values or inserting a median value depending on the completeness of that row, and also fixing any misspellings. Once we were done with the data cleaning process, it was now our job to take the clean data and begin to cluster the dataset using K-Means clustering based on the features and responses given in the survey. Each cluster represented a specific customer profile group. Once the clustering process was completed, we must begin to analyze the clusters and build their customer profile. Once the customer profiles were completed, we then looked for data that stood out to us and we began to find ways to satisfy that customer so that we could increase customer retention and attract customers to Chipotle. The results were delivered via PowerPoint using data Visualization.

Skills and Concepts Demonstrated: Python, NumPy, Pandas, MatPlotLib, K-Means Clustering, Data Cleaning and Exploration, Data Visualization, Microsoft Excel, Pivot Tables, PowerPoint, Feature Importance

Project Team Members:
Melanie He
Mark Santos (mksantos6677@gmail.com)
Steven Zhu (stevnzhu@gmail.com)

Explore Repository

SQL Social Media App Project

In this project, our class was tasked with implementing a SQL database into a real world business problem. Our group decided that we created an imaginary social media app that is similar to Instagram. Using this business situation, we formulated business questions and problems. In order to answer the business questions, we created a SQL database and inserted data into the database. After creating the SQL database, we did a number of SQL queries to answer the business questions and presented our findings with a PowerPoint deck.

Skills and Concepts Demonstrated: SQL, MySQL, Querying, PowerPoint, Data Mapping, Mockaroo for psuedo data, SQL Tables, Business Insights, Decision Making, Data Analysis, Relational Databases

Project Team Members:
Steven Zhu (stevnzhu@gmail.com)

Explore Repository

Machine Learning on Titanic Dataset

In this project our class was tasked with making survival predictions based on the Titanic dataset found on Kaggle. Using the dataset, we trained a logistic regression model that can predict the survivors based on a set of features about the passengers on the Titanic.

Skills and Concepts Demonstrated: Python, Data Cleaning and Exploration, Pandas, Data Visualization, MatPlotLib, Model Building, SkLearn, Logisitic Regression Model

Explore Repository

Iris Dataset Project

The target of the project is to have a hands on full data analytics experience from A to Z. That means starting with raw data and reading it into python then doing some analytical and statistical observations on the data and finally doing some supervised and unsupervised learning on the data on one of the most famous datasets machine learning datasets called the IRIS Classification Dataset.

Skills and Concepts Demonstrated: Python, Pandas, Numpy, MatPlotLib, SkLearn, Scipy, Data Loading and Cleaning, Statistical Exploring and Analysis , Machine Learning, Non-Supervised and Supervised Learning, K-Means Clustering, PCA Reduction

Explore Repository

Certificates

Google Data Analytics (Completed Jan 2023)

I completed a Google Data Analytics 8-course program administered on Coursera. In this program I experienced hands-on, practice-based assessments and are designed to prepare for introductory-level roles in Data Analytics. I used tools and platforms including spreadsheets, SQL, Tableau, and R. I was able to learn how to prepare, process, analyze, and share data for thoughtful action..

Skills and Concepts Demonstrated: Spreadsheet, Data Cleansing, Data Analysis, Data Visualization, SQL, Questioning, Decion-Making, Problem Solving, Metadata, Data Collection, Data Ethics, Sample Size Determination.

View Certificate

Accenture Data Analytics and Data Visualization Virtual Experience Program on Forage (Completed Jan 2023)

During this virtual internship experience I was able to learn about project understanding, data cleaning and modeling, data visualization and storytelling and lastly, presenting to a client. I was able to get hands on experience of what it is like to received a project from a company, then doing data exploration and analysis to create a presentation to give insights and recommendations on how to solve the business problem.

Skills and Concepts Demonstrated: Teamwork, Strategy, Project Planning, Data Understanding, Data Modeling, Data Analysis, Storytelling, Data Visualization, Presentations, Communication, Public Speaking, Programming Skills (Python), PowerPoint Presentation, Microsoft Excel.

Explore Repository