FHNW Institute for Data Science Datasets

Swiss German Speech to Standard German Text

Swiss Parliaments Corpus

Dataset based on Swiss German parliament debates and their Standard German transcripts.
Parliament: Grosser Rat Kanton Bern
License: MIT

Version 1, 2020-01-28

Download
Paper

Initial version, used in GermEval 2020 Task 4, 70 hours of training data

Version 2, 2020-08-25

Download
Paper

Improved and extended version, up to 293 hours of training data