Introduction

(More details in Research)

Objectives

SCRIBE’s primary objective is to develop a Norwegian speech-to-text transcription system for multi-party conversations in realistic recording conditions.

In order to attain the project goal, research and technology development beyond the state-of-the-art is needed within several key areas. These include language universal issues, as well as issues specifically related to the Norwegian language.

Secondary Objectives

  • We will develop models that are robust to disfluencies that are typical in spontaneous conversational speech, that can cope with turn taking and take advantage of the context in the dialog.
  • The models will also support the use of spoken dialects and different orthographies (Bokmål, Nynorsk, or dialect specific).
  • We will define evaluation metrics that predict the quality of the transcription based on semantics rather than merely word error rate.
  • Finally, we will contribute to the theoretical and methodological development of machine learning with sparse data.

We are grateful for funding from The Research Council of Norway within the IKTPLUSS initiative.

News

14. Mar. 2024

NordTrans press conference at NTNU on broadcast news transcription in Norwegian. Register here!!

12. Mar. 2024

The SCRIBE team submitted 5 papers to Interspeech 2024!

15. Feb. 2024

Nationalbiblioteket launches the newest Norwegian Whisper model.

18. Dec. 2023

Giampiero Salvi attends the Workshop on Training and Evaluation Data for Italian/Multilingual LLM.

13. Dec. 2023

Giampiero Salvi presents the project at SapienzaNLP.

31. Oct. 2023

Demonstration at NorwAI Innovate. See also the Demos page.

27. Oct. 2023

Second SCRIBE Hackathon in Trondheim. We will prepare a demo for the coming NorwAI Innovate conference

20. Oct. 2023

The SCRIBE team has submitted two papers to LREC-Coling2024!

1. Sep. 2023

Heming Strømholt Bremnes is joining the team!

... see all News

Industrial Partners