This dashboard provides an overview of the Year-To-Date (YTD) and historical performance of the Nasdaq Futures Index and the Nasdaq-100 stocks. It was created as part of an end-to-end data pipeline project, leveraging Python, Dagster, dbt, and Quarto.
Read MoreIn the pursuit to expand my skills and knowledge in Azure, I embarked on the Azure Cloud Resume Challenge. I first discovered this challenge through Gwyn’s video and I knew right away it’d be a fun project with tremendous learning opportunities. Despite having zero experience with Terraform and having only basic knowledge of GitHub Actions and Azure I was optimistic about using my programming and process automation skills to take an infrastructure as code approach to this challenge.
Read MoreThis project was created to discover high volume options contracts within the first thirty minutes of market open. It scrapes data from the Chicago Board Options Exchange (CBOE) to find potential profitable options based on some basic filters. This is a rudimentary scanner that only identifies weekly options.
Read MoreThis post documents the steps taken for the setup and configuration of my Azure Databricks workspace. The objective was to create a cost-efficient and versatile workspace in Databricks, aimed at providing a sandbox environment for data engineering experiments.
Read MoreI discovered this fantastic post written by Claire Carroll and I was instantly inspired. I really enjoyed studying her process and figured that it’d be a fun exercise to implement this data transformation in R. I’m not going to convert the SQL code from Claire’s post to R code verbatim. Instead, I’ll implement my own process for the data transformation. Overall, the input and output data will be the same as in Claire’s post.
Read MoreThe option_transactions table shows options trades executed on the thinkorswim trading platform. Query the table to calculate the profit/loss made on each trade.
Read MoreThe orders table shows the number of products ordered each day. Query the data to return the most frequent item(s) ordered each day. Return multiple items if there are ties.
Read MoreThis post focuses on exploring the 2019 crime report profiles of Florida’s counties. The goal is to visualizing the similarities and differences in the overall crime report profiles between counties.
Read More
The data used for this analysis was obtained from the Journal of Statistical Education Data Archive. It consists of a sample of 654 subjects, male and female, aged 3 to 19 years old from the area of East Boston during the late 1970s. This analysis attempts to determine the relationship between forced expiratory volume (FEV) and smoking status via linear regression.
Read MoreThis post will be the first in a series of posts demonstrating how I approach data wrangling problems. As an analyst in a clinical research setting I spend a great majority of my time cleaning and transforming data for analysis and thought it’d be a good idea to start documenting my approach to various types of data.
Read MoreAs part of the final project for my Data Science For Clinical Research class in graduate school I needed to create a machine learning model capable of detecting asthmatic subjects. The purpose of this project was not to create a production ready model using state-of-the art machine learning algorithms but to understand the fundamentals of data science.
Read More