en
                    array(2) {
  ["de"]=>
  array(13) {
    ["code"]=>
    string(2) "de"
    ["id"]=>
    string(1) "3"
    ["native_name"]=>
    string(7) "Deutsch"
    ["major"]=>
    string(1) "1"
    ["active"]=>
    int(0)
    ["default_locale"]=>
    string(5) "de_DE"
    ["encode_url"]=>
    string(1) "0"
    ["tag"]=>
    string(2) "de"
    ["missing"]=>
    int(0)
    ["translated_name"]=>
    string(6) "German"
    ["url"]=>
    string(97) "https://www.statworx.com/case-studies/ki-in-der-bildung-texte-effizienter-erstellen-und-bewerten/"
    ["country_flag_url"]=>
    string(87) "https://www.statworx.com/wp-content/plugins/sitepress-multilingual-cms/res/flags/de.png"
    ["language_code"]=>
    string(2) "de"
  }
  ["en"]=>
  array(13) {
    ["code"]=>
    string(2) "en"
    ["id"]=>
    string(1) "1"
    ["native_name"]=>
    string(7) "English"
    ["major"]=>
    string(1) "1"
    ["active"]=>
    string(1) "1"
    ["default_locale"]=>
    string(5) "en_US"
    ["encode_url"]=>
    string(1) "0"
    ["tag"]=>
    string(2) "en"
    ["missing"]=>
    int(0)
    ["translated_name"]=>
    string(7) "English"
    ["url"]=>
    string(104) "https://www.statworx.com/en/case-studies/ai-in-education-creating-and-evaluating-texts-more-efficiently/"
    ["country_flag_url"]=>
    string(87) "https://www.statworx.com/wp-content/plugins/sitepress-multilingual-cms/res/flags/en.png"
    ["language_code"]=>
    string(2) "en"
  }
}
                    
Contact
Case Studies
Case Study

AI in education: creating and evaluating texts more efficiently

In a PoC for an educational service provider, we showed that assessment and text creation processes can be automated and standardised with AI, which reduces the workload of teachers and can sustainably optimise the learning experience of students.

  • Industry Other
  • Topic Frontend Solution / GenAI
  • Tools Azure OpenAI, LangChain, Streamlit, FastAPI
  • Duration 3 months

Challenge

A leading provider of language testing and educational services recognised the need to automate the creation of test content and the correction and marking of these tests.

Currently, correcting and grading texts takes up a lot of teachers’ time – not only because of the manual effort involved, but also because teachers formulate individual suggestions for improvement for each text. Another problem is that every assessment is subjective and can vary depending on the examiner. However, students who have to prove a certain language level to an authority or educational institution have an interest in a high degree of objectivity and a quick evaluation of their tests.

There is also a second challenge: the manual creation of test content is repetitive and must be carried out at short intervals. At the same time, there is a growing need for customised text tasks that address different language levels and text forms. Taking specific EU requirements into account increases the complexity and time required for this process.

The challenge for statworx was to prove with a proof-of-concept (PoC) that these processes can be automated and standardised. This should show that both the efficiency and consistency of the assessments can be increased.

Approach

In order to overcome these challenges, statworx initiated two use cases as part of the PoC: “Author AI” and “Rater AI”. The team developed a fully functional AI backend and an intuitive user frontend to interact with it.

Use case 1: Automated text evaluation (evaluator AI)
The first use case focussed on the implementation of an evaluator AI that evaluates texts at German B1 level based on defined criteria and provides feedback for improvement. The criteria include:

  • Content appropriateness: are all key questions answered?
  • Linguistic appropriateness: What is the quality of the sentences and linguistic expression?
  • Formal correctness: Are the spelling and grammar correct?

Use case 2: Automation of text tasks (authoring AI)
The second use case is aimed at automating the creation of tasks using an authoring AI. This AI can generate texts at a specific language level and for a specific text form, depending on requirements and prompt. For example:

  • Creating a text task
  • Creating a cloze text
  • Create solution options for the text

Both concepts and implementations now serve as the foundation for further use cases.

Result

These two use cases impressively demonstrate how AI technologies can improve efficiency and quality in the education sector. The automation and standardisation of assessment and text creation processes not only reduces the workload for teachers, but also sustainably optimises the learning experience for students.

Assessor AI: increased efficiency and consistency
The implementation of the teacher AI led to a significant increase in efficiency and consistency in text assessment. Through extensive testing, the team was able to prove that the AI is more robust and consistent in its assessment than human examiners. This was confirmed by measuring the evaluation agreement using the Cohen’s Kappa test statistic.

Authoring AI: potential for automation
Authoring AI shows great potential in the automation of text tasks. The previous steps for creating tasks, cloze texts and solution options have been successfully implemented.

Expert

Contact us

Learn more!

As one of the leading companies in the field of data science, machine learning, and AI, we guide you towards a data-driven future. Learn more about statworx and our motivation.
About us