Benford's law is a mathematical model, very recurrent in practice for a wide variety of datasets, used to represent the frequencies of digits. A well-established usage of Benfordness statistical testing lies within investigations aimed to ascertain if balance sheet and income statement data are genuine. A typical, frustrating problem of Benfordness statistical tests on big, practical datasets is that they often provide p-valuessmaller than expected when the Benfordness null hypothesis is very realistic. A possible reason is that data are contaminated by some kind of noise. In this paper we propose the deconvolution approach to alleviate this issue, using both simulated and real data.

Validating Benfordness on contaminated data

Di Marzio, Marco;Fensore, Stefania;Passamonti, Chiara
2024-01-01

Abstract

Benford's law is a mathematical model, very recurrent in practice for a wide variety of datasets, used to represent the frequencies of digits. A well-established usage of Benfordness statistical testing lies within investigations aimed to ascertain if balance sheet and income statement data are genuine. A typical, frustrating problem of Benfordness statistical tests on big, practical datasets is that they often provide p-valuessmaller than expected when the Benfordness null hypothesis is very realistic. A possible reason is that data are contaminated by some kind of noise. In this paper we propose the deconvolution approach to alleviate this issue, using both simulated and real data.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11564/846453
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact