Building and Validating Hierarchical Lexicons with a Case Study on Personal Values

Steven Wilson, Yiting Shen, Rada Mihalcea

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We introduce a crowd-powered approach for the creation of a lexicon for any theme given a set of seed words that cover a variety of concepts within the theme. Terms are initially sorted by automatically clustering their embeddings and subsequently rearranged by crowd workers in order to create a tree structure. This type of organization captures hierarchical relationships between concepts and allows for a tunable level of specificity when using the lexicon to collect measurements from a piece of text. We use a lexicon expansion method to increase the overall coverage of the produced resource. Using our proposed approach, we create a hierarchical lexicon of personal values and evaluate its internal and external consistency. We release this novel resource to the community as a tool for measuring value content within text corpora.
Original languageEnglish
Title of host publicationSocial Informatics
EditorsSteffen Staab, Olessia Koltsova, Dmitry I. Ignatov
Place of PublicationCham
PublisherSpringer
Pages455-470
Number of pages16
ISBN (Electronic)978-3-030-01129-1
ISBN (Print)978-3-030-01128-4
DOIs
Publication statusPublished - 20 Sept 2018
Event10th International Conference on Social Informatics 2018 - Saint Petersburg, Russian Federation
Duration: 25 Sept 201828 Sept 2018
https://socinfo2018.hse.ru/

Publication series

NameLecture Notes in Computer Science (LCNS)
PublisherSpringer, Cham
Volume11185
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Conference on Social Informatics 2018
Abbreviated titleSocInfo 2018
Country/TerritoryRussian Federation
CitySaint Petersburg
Period25/09/1828/09/18
Internet address

Keywords / Materials (for Non-textual outputs)

  • Lexicon induction
  • Crowd sourcing
  • Personal values

Fingerprint

Dive into the research topics of 'Building and Validating Hierarchical Lexicons with a Case Study on Personal Values'. Together they form a unique fingerprint.

Cite this