Download PDFOpen PDF in browser

Towards an Information-Theoretic Approach to Population Structure

17 pagesPublished: June 22, 2012

Abstract

This paper uses an information-theoretic perspective to propose multi-locus informativeness measures for ancestry inference. These measures describe the potential for correct classification of unknown individuals to their source populations, given genetic data on population structure. Motivated by Shannon's axiomatic approach in deriving a unique information measure for communication (Shannon 1948), we first identify a set of intuitively justifiable criteria that any such quantitative information measure should satisfy, and then select measures that comply with these criteria. It is shown that standard information-theoretic measures such as multidimensional mutual information cannot completely account for informativeness when source populations differ in size, necessitating a decision-theoretic approach.

Keyphrases: classification, information theory, informativeness for assignment, population divergence

In: Andrei Voronkov (editor). Turing-100. The Alan Turing Centenary, vol 10, pages 353-369.

BibTeX entry
@inproceedings{Turing-100:Towards_Information_Theoretic_Approach,
  author    = {Omri Tal},
  title     = {Towards an Information-Theoretic Approach to Population Structure},
  booktitle = {Turing-100. The Alan Turing Centenary},
  editor    = {Andrei Voronkov},
  series    = {EPiC Series in Computing},
  volume    = {10},
  publisher = {EasyChair},
  bibsource = {EasyChair, https://easychair.org},
  issn      = {2398-7340},
  url       = {/publications/paper/8T6},
  doi       = {10.29007/268w},
  pages     = {353-369},
  year      = {2012}}
Download PDFOpen PDF in browser