International projects, online questionnaires, or data mining techniques now allow audience researchers to gather very large and complex datasets. But whilst data collection capacity is hugely growing, qualitative analysis, conversely, becomes increasingly difficult to conduct. In this paper, I suggest a strategy that might allow the researcher to manage this complexity. The World Hobbit Project dataset (36,109 cases), including answers to both closed and open-ended questions, was used for this purpose. The strategy proposed here is based on between-methods sequential triangulation, and tries to combine statistical techniques (k-means clustering) with textual analysis. K-means clustering permitted to reduce data to a small number of ideal-typical respondents: the ‘average spectator’, the ‘die-hard fan’, the ‘cultured spectator’, the ‘alternative spectator’. These clusters are the outcome of a crossvalidation process. Textual responses corresponding to each cluster, in fact, were also analyzed using a quali-qualitative approach, in order to both refine the clusters and identify meaningful discourse patterns. The methodological mix proposed can be used with confidence, since it proved to yield reliable results.

Trobia, A. (2016). Selecting significant respondents from large audience datasets: The case of the World Hobbit Project. PARTICIPATIONS, 13(2), 440-468.

Selecting significant respondents from large audience datasets: The case of the World Hobbit Project

TROBIA, Alberto
2016-01-01

Abstract

International projects, online questionnaires, or data mining techniques now allow audience researchers to gather very large and complex datasets. But whilst data collection capacity is hugely growing, qualitative analysis, conversely, becomes increasingly difficult to conduct. In this paper, I suggest a strategy that might allow the researcher to manage this complexity. The World Hobbit Project dataset (36,109 cases), including answers to both closed and open-ended questions, was used for this purpose. The strategy proposed here is based on between-methods sequential triangulation, and tries to combine statistical techniques (k-means clustering) with textual analysis. K-means clustering permitted to reduce data to a small number of ideal-typical respondents: the ‘average spectator’, the ‘die-hard fan’, the ‘cultured spectator’, the ‘alternative spectator’. These clusters are the outcome of a crossvalidation process. Textual responses corresponding to each cluster, in fact, were also analyzed using a quali-qualitative approach, in order to both refine the clusters and identify meaningful discourse patterns. The methodological mix proposed can be used with confidence, since it proved to yield reliable results.
2016
Settore SPS/07 - Sociologia Generale
Trobia, A. (2016). Selecting significant respondents from large audience datasets: The case of the World Hobbit Project. PARTICIPATIONS, 13(2), 440-468.
File in questo prodotto:
File Dimensione Formato  
Trobia (2016) The Hobbit.pdf

accesso aperto

Dimensione 1.12 MB
Formato Adobe PDF
1.12 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/217215
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact