Project Description
This supplement describes the data provided for your group project. These are real-world datasets. To
protect the data provider’s proprietary information, the structure of these datasets, the locations of the
tanks therein, and the invoices have been obfuscated so as not to reflect the real information of the
data provider.
The datasets chronicle over a year’s fuel purchases (by the gas station owners) and sales at all city gas
stations.
Data Dictionary
Locations.csv
This dataset lists all the gas station locations and contains the following columns:
• Gas Station Location: The unique ID of the gas station
• Gas Station Name: The gas station name
• Gas Station Address: The gas station address
• Gas Station Latitude: The gas station latitude
• Gas Station Longitude: The gas station longitude
Tanks.csv
Each gas station location may have more than one tank. This dataset contains information about these
tanks and their attributes
• Tank ID: A unique ID of each tank in the system
• Tank Location: Gas station this tank is located at
• Tank Number: ID of each tank in a specific location
• Tank Type: The type of fuel this tank is used for: U for regular gas, D for Diesel, and P for
premium gas. You can consider D and P as Gas.
• Tank Capacity: Capacity of the tank in liters
Invoices.csv
Each gas station purchases different fuel types from its supplier(s). Every delivery of each fuel type to all
tanks of a location generates one invoice. The Invoices.csv dataset contains information about these
invoices over time and has the following columns:
• Invoice Date: Date of the purchase
• Invoice ID: Unique ID of the invoice
• Invoice Gas Station Location: Gas station location
• Gross Purchase Cost: Total Canadian Dollar (CAD) paid for the purchase
• Amount Purchased: Total number of fuel liters purchased
• Fuel Type: Purchased fuel type