This collection of csv files contains information from the internet movie database (IMDb; imdb.com), which can be packaged into a database for example problems. The files have been put together in support of exercises for the LSSTC DSFP.
The data were collected from the website: https://relational.fit.cvut.cz/dataset/IMDb
This data set was curated as part of an exercise for the LSSTC Data Science Fellowship Program (DSFP).
Some of the data are sourced from the Sloan Digital Sky Survey (SDSS; sdss.org).
These data are packaged as a "tarball" which includes main files. There are two main files:
training_sources.csv – list of sources in the training_lcs/ folder, including name, classification, mean mag, total number of observations, and duration of observations
test_sources.csv –list of sources in the test_lcs/ folder, including name, classification, mean...
This data set includes photometric features as measured by the Sloan Digital Sky Survey (SDSS; sdss.org) that can be used as a training set in a machine learning model to separate galaxies and stars. It is used as part of a problem developed for the LSSTC Data Science Fellowship Program....
This data set was curated as part of an exercise for the LSSTC Data Science Fellowship Program (DSFP). The data are sourced from the Sloan Digital Sky Survey (SDSS; sdss.org).
The following query of the SDSS database was used to select these data (note - use of "TOP" in the...