Abstract
In this paper we present an approach for the task of author profiling. We propose a coherent grouping of features combined with appropriate preprocessing steps for each group. The groups we used were stylometric and structural, featuring among others, trigrams and counts of twitter specific characteristics. We address gender and age prediction as a classification task and personality prediction as a regression problem using Support Vector Machines and Support Vector Machine Regression respectively on documents created by joining each user’s tweets.
| Original language | Undefined/Unknown |
|---|---|
| Title of host publication | Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, Toulouse, France, September 8-11, 2015. |
| Editors | G.J.F. Jones, L. Cappellato, N. Ferro, E. San Juan |
| Publisher | CEUR-WS |
| Number of pages | 7 |
| Publication status | E-pub ahead of print - 11 Sept 2015 |
| Externally published | Yes |
| Event | CLEF 2015: Conference and Labs of the Evaluation Forum Experimental IR meets Multilinguality, Multimodality and Interaction - Toulouse, France Duration: 8 Sept 2015 → 11 Sept 2015 Conference number: 6 http://clef2015.clef-initiative.eu/cfl.php |
Publication series
| Name | CEUR Workshop Proceedings |
|---|---|
| Publisher | CEUR-WS |
| Volume | 1391 |
| ISSN (Print) | 1613-0073 |
Conference
| Conference | CLEF 2015: Conference and Labs of the Evaluation Forum Experimental IR meets Multilinguality, Multimodality and Interaction |
|---|---|
| Abbreviated title | CLEF 2015 |
| Country/Territory | France |
| City | Toulouse |
| Period | 8/09/15 → 11/09/15 |
| Internet address |
Keywords / Materials (for Non-textual outputs)
- Information retrieval systems
- Age predictions
- Classification tasks
- Personality predictions
- Pre-processing step
- Regression problem
- Structural feature
- Support vector machine regressions
- Support vector machines
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver