How do the kids speak? Improving educational use of text mining with child-directed language models

Organisciak, Peter; Newman, Michele; Eby, David; Acar, Selcuk; Dumas, Denis

doi:10.1108/ILS-06-2022-0082

Article navigation

Research Article| January 19 2023

How do the kids speak? Improving educational use of text mining with child-directed language models

Peter Organisciak;

Peter Organisciak

Department of Research Methods and Information Science,

University of Denver

, Denver, Colorado,

USA

Peter Organisciak can be contacted at: Peter.Organisciak@du.edu

Search for other works by this author on:

This Site

PubMed

Google Scholar

Michele Newman;

Michele Newman

Information School,

University of Washington

, Seattle, Washington,

USA

Search for other works by this author on:

This Site

PubMed

Google Scholar

David Eby;

David Eby

School of Information Sciences,

University of Illinois at Urbana-Champaign

, Champaign, Illinois,

USA

Search for other works by this author on:

This Site

PubMed

Google Scholar

Selcuk Acar;

Selcuk Acar

Department of Educational Psychology,

University of North Texas

, Denton, Texas,

USA

Search for other works by this author on:

This Site

PubMed

Google Scholar

Denis Dumas

Department of Educational Psychology,

University of Georgia

, Athens, Georgia,

USA

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Peter Organisciak can be contacted at: Peter.Organisciak@du.edu

Publisher: Emerald Publishing

Received: June 22 2022

Revision Received: September 30 2022

Revision Received: November 23 2022

Accepted: November 30 2022

Online ISSN: 2398-5356

Print ISSN: 2398-5348

Funding

Funding Group:

Award Group:
- Funder(s):
- Award Id(s):
  R305A200519

2022

Emerald Publishing Limited

Licensed re-use rights only

Information and Learning Sciences (2023) 124 (1-2): 25–47.

https://doi.org/10.1108/ILS-06-2022-0082

Purpose

Most educational assessments tend to be constructed in a close-ended format, which is easier to score consistently and more affordable. However, recent work has leveraged computation text methods from the information sciences to make open-ended measurement more effective and reliable for older students. The purpose of this study is to determine whether models used by computational text mining applications need to be adapted when used with samples of elementary-aged children.

Design/methodology/approach

This study introduces domain-adapted semantic models for child-specific text analysis, to allow better elementary-aged educational assessment. A corpus compiled from a multimodal mix of spoken and written child-directed sources is presented, used to train a children’s language model and evaluated against standard non-age-specific semantic models.

Findings

Child-oriented language is found to differ in vocabulary and word sense use from general English, while exhibiting lower gender and race biases. The model is evaluated in an educational application of divergent thinking measurement and shown to improve on generalized English models.

Research limitations/implications

The findings demonstrate the need for age-specific language models in the growing domain of automated divergent thinking and strongly encourage the same for other educational uses of computation text analysis by showing a measurable difference in the language of children.

Social implications

Understanding children’s language more representatively in automated educational assessment allows for more fair and equitable testing. Furthermore, child-specific language models have fewer gender and race biases.

Originality/value

Research in computational measurement of open-ended responses has thus far used models of language trained on general English sources or domain-specific sources such as textbooks. To the best of the authors’ knowledge, this paper is the first to study age-specific language models for educational assessment. In addition, while there have been several targeted, high-quality corpora of child-created or child-directed speech, the corpus presented here is the first developed with the breadth and scale required for large-scale text modeling.

2022

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

How do the kids speak? Improving educational use of text mining with child-directed language models

Email Alerts

Cited By

How do the kids speak? Improving educational use of text mining with child-directed language models Available to Purchase

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable

How do the kids speak? Improving educational use of text mining with child-directed language models