Actions
Task #3855
closedTask #3680: RA4a - Automatic error prediction
Task #3698: Experiment with one-class clasification for join cost enhancements
More data for artefacts collection
Status:
Postponed
Priority:
Normal
Assignee:
Target version:
Start date:
06.04.2016
Due date:
10.04.2016
% Done:
0%
Estimated time:
Description
We need more data for listening tests. Especially we need to increase the coverage of rare vowels. Currently we have:
| phone | total | OK | artefact | 
| a | 78 | 60 | 18 | 
| e | 82 | 46 | 36 | 
| i | 49 | 30 | 19 | 
| o | 92 | 50 | 42 | 
| u | 23 | 22 | 1 | 
| A | 123 | 17 | 104 | 
| E | 4 | 4 | 0 | 
| I | 23 | 17 | 6 | 
| O | 0 | 0 | 0 | 
| U | 4 | 4 | 0 | 
We can either try to find additional words in the corpus (shorter, though), or build "artificial" words by joining two halves of words (or words transitions) from the corpus.
Files
Actions