Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • apa.csl
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Information Theoretic Deductions Using Machine Learning with an Application in Sociology
Karlstad University, Faculty of Health, Science and Technology (starting 2013), Department of Mathematics and Computer Science (from 2013).ORCID iD: 0000-0001-7547-8111
Karlstad University, Faculty of Health, Science and Technology (starting 2013), Department of Mathematics and Computer Science (from 2013).
Karlstad University, Faculty of Health, Science and Technology (starting 2013), Department of Mathematics and Computer Science (from 2013).ORCID iD: 0000-0003-0368-9221
University of Wollongong, Australia.
Show others and affiliations
2024 (English)In: Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods ICPRAM / [ed] Modesto Castrillon-Santana; Maria De Marsico; Ana Fred, SciTePress, 2024, Vol. 1, p. 320-328Conference paper, Published paper (Refereed)
Abstract [en]

Conditional entropy is an important concept that naturally arises in fields such as finance, sociology, and intelligent decision making when solving problems involving statistical inferences. Formally speaking, given two random variables X and Y, one is interested in the amount and direction of information flow between X and Y . It helps to draw conclusions about Y while only observing X. Conditional entropy H(Y |X) quantifies the amount of information flow from X to Y . In practice, calculating H(Y |X) exactly is infeasible. Current estimation methods are complex and suffer from estimation bias issues. In this paper, we present a simple Machine Learning based estimation method. Our method can be used to estimate H(Y |X) for discrete X and bi-valued Y. Given X and Y observations, we first construct a natural binary classification training dataset. We then train a supervised learning algorithm on this dataset, and use its prediction accuracy to estimate H(Y |X). We also present a simple condition on the prediction accuracy to determine if there is information flow from X to Y. We support our ideas using formal arguments and through an experiment involving a gender-bias study using a part of the employee database of Karlstad University, Sweden. 

Place, publisher, year, edition, pages
SciTePress, 2024. Vol. 1, p. 320-328
Keywords [en]
Conditional Entropy, Binary Classification, Information Theory, Supervised Machine Learning, Automated Data Mining, Machine Learning in Sociology
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kau:diva-99723DOI: 10.5220/0012371600003654Scopus ID: 2-s2.0-85190701985ISBN: 978-989-758-684-2 (electronic)OAI: oai:DiVA.org:kau-99723DiVA, id: diva2:1859588
Conference
13th International Conference on Pattern Recognition Applications and Methods ICPRAM, Rome, Italy, February 24-26, 2024.
Available from: 2024-05-22 Created: 2024-05-22 Last updated: 2024-05-22Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopusConference website

Authority records

Ramaswamy, ArunselvanMa, YunpengAlfredsson, StefanBrunstrom, Anna

Search in DiVA

By author/editor
Ramaswamy, ArunselvanMa, YunpengAlfredsson, StefanBrunstrom, Anna
By organisation
Department of Mathematics and Computer Science (from 2013)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 92 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • apa.csl
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf