Edinburgh Research Explorer

ptype: probabilistic type inference

Research output: Contribution to journalArticle

Related Edinburgh Organisations

Open Access permissions

Open

Documents

  • Download as Adobe PDF

    Accepted author manuscript, 2.32 MB, PDF document

    Licence: Creative Commons: Attribution (CC-BY)

  • Download as Adobe PDF

    Final published version, 1.17 MB, PDF document

    Licence: Creative Commons: Attribution (CC-BY)

https://link.springer.com/article/10.1007/s10618-020-00680-1
Original languageEnglish
Pages (from-to)870-904
Number of pages35
JournalData Mining and Knowledge Discovery
Volume34
Issue number3
Early online date16 Mar 2020
DOIs
Publication statusPublished - 31 May 2020

Abstract

Type inference refers to the task of inferring the data type of a given column of data. Current approaches often fail when data contains missing data and anomalies, which are found commonly in real-world data sets. In this paper, we propose ptype, a probabilistic robust type inference method that allows us to detect such entries, and infer data types. We further show that the proposed method outperforms existing methods.

    Research areas

  • Type inference, Robustness, Probabilistic finite-state machine

Download statistics

No data available

ID: 142286237