A Iris Data Set

EasyChair Preprint 5924

12 pages•Date: June 27, 2021

Abstract

The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper on the use of multiple measurements in taxonomic problems. It is sometimes called Anderson's Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. The data set consists of 50 samples from each of three species of Iris (Iris Setosa, Iris virginica, and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimeters. This dataset became a typical test case for many statistical classification techniques in machine learning such as support vector machines. The dataset contains a set of 150 records under 5 attributes - Petal Length, Petal Width, Sepal Length, Sepal width, and Class (Species).

Keyphrases: K-means, normalization, selection of attribute

Links:

https://easychair.org/publications/preprint/M2KG

BibTeX entry

BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:

@booklet{EasyChair:5924,
  author    = {Shiza Mushtaq},
  title     = {A Iris Data Set},
  howpublished = {EasyChair Preprint 5924},
  year      = {EasyChair, 2021}}

Download PDF Open PDF in browser