Sample Data
Several data files are provided below and are available to download for free and to import into Vesta.
1. Breast Cancer in lower-peninsula Michigan (Shapefile and *.csv)
This folder contains Shapefile system data files and a *.csv data file.
The data columns in the Shapefile (*.shp) are:
- FIPS - this is the name of the identifier used in this example. The FIPS code is used to numerically identify a county in the US, in this example only lower-peninsula Michigan counties are used.
- County - The name of the Michigan county
- Poverty - Socioeconomical status is labeled as poverty (level) which comes from the census bureau
- Distance - Access to health care is measured by driving distance from residence of patient to the nearest provider location
The data columns in the *.csv file are:
- FIP - this is the FIPS code for each respective county
- Year - this is the time the data was collected
- Rate - this is the measured cancer rate
- Stddev - standard deviation is calculated from the total number of cases detected
Click the link Breast Cancer Data Shapefile and *.csv to download the data to your machine.
2. Point data of Flint (*.csv)
This data file contains point data of levels of lead and copper sampled in Flint, Michigan. It is a *.csv file. The data columns are:
- ID - used to keep the data organized geographically. It is not related to any geographic identifier.
- Lat - the latitude of each data point
- Long - the longitude of each data point
- Lead_ppb - lead measured in parts per billion at each sample location
- Copper_ppb - copper measured in parts per billion at each sample location
- Log(Pb) - the logarithm of the lead measurement
- Log(Cu) - the logarithm of the copper measurement
Click the link Point Data of Flint (*.csv) to download the data to your machine.
3. Point data of Flint (Shapefile)
This data file contains point data of levels of lead and copper sampled in Flint, Michigan. It is a Shapefile system. The data columns are:
- ID - used to keep the data organized geographically. It is not related to any geographic identifier.
- Lat - the latitude of each data point
- Long - the longitude of each data point
- Lead_ppb - lead measured in parts per billion at each sample location
- Copper_ppb - copper measured in parts per billion at each sample location
- Log(Pb) - the logarithm of the lead measurement
- Log(Cu) - the logarithm of the copper measurement
Click the link Point Data of Flint (Shapefile) to download the data to your machine.
4. Polygon data of poverty in Flint (Shapefile)
This data file contains poverty level information from Flint, Michigan. It is a Shapefile system. The data columns are:
- ID - used to keep the data organized geographically. It is not related to any geographic identifier.
- Object ID - the geographic identifier that can be used to pair with merged data from another data file.
- Start Time - sample time that is useful when using BioMedware software SpaceStat. Not utilized by Vesta.
- End Time - sample time that is useful when using BioMedware software SpaceStat. Not utilized by Vesta.
- Name - The number associated with the same of the county.
- Poverty - the poverty level measured at each geographic location.
Click the link Polygon data of Flint Poverty (Shapefile) to download the data to your machine.
5. Arsenic contamination (Shapefile and *.csv)
This folder contains Shapefile system data files and a *.csv data file.
The data columns in the Shapefile (*.shp) are:
- ID - used to keep the data organized geographically. It is not related to any geographic identifier.
- Object ID - the geographic identifier that can be used to pair with merged data from another data file.
- Start Time - sample time that is useful when using BioMedware software SpaceStat. Not utilized by Vesta.
- End Time - sample time that is useful when using BioMedware software SpaceStat. Not utilized by Vesta.
- STATEFP - code for state of Michigan
- COUNTYFP - code used for Michigan county
- GEOID - geographic identifier for city in a Michigan county
- NAME - name of city
The data columns in the *.csv file are:
- ID - used to keep the data organized geographically. It is not related to any geographic identifier.
- Longitude - the longitude of each data point
- Latitude - the latitude of each data point
- arsenic (ppb) - concentration of arsenic measured in parts per billion
This folder also contains a project file (Arsenic_tutorial.gmd) that can be directly opened in Vesta.
Click the link Arsenic contamination Shapefile and *.csv to download the data to your machine.