**TrainValTest1_csvAPartirDeTextGrid.py :** This script reads a TextGrid transcription file from the textGrid folder and retrieves the start time,end time and the corresponding text of speaker1 and speaker2 and saves them in two different files with .csv extension for each speaker. These two files will be placed in the textGridEnCSV/ folder.
**TrainValTest1_csvAPartirDeTextGrid.py :** This script reads a TextGrid transcription file from the textGrid folder and retrieves the start time,end time and the corresponding text of speaker1 and speaker2 and saves them in two different files with .csv extension for each speaker. These two files will be placed in the textGridEnCSV/ folder.
**TrainValTest2_Random.py :** This script reads each csv file from the textGridEnCSV folder, shuffles the lines where text is non-empty and splits into train/val/test.
**TrainValTest2_Random.py :** This script reads each csv file from the textGridEnCSV folder, shuffles the lines where text is non-empty and splits into train/val/test.
**TrainValTest7_Random.py :** This script groups all the words into 10 bins (each bin represents 10% of the sum of occurrences of the words in the Train), this will allow you to choose words from all the bins for the test phase (there will be bins where the words are the least repeated in the corpus and bins where the words are the most repeated in the corpus).
**TrainValTest7_Random.py :** This script groups all the words into 10 bins (each bin represents 10% of the sum of occurrences of the words in the Train), this will allow you to choose words from all the bins for the test phase (there will be bins where the words are the least repeated in the corpus and bins where the words are the most repeated in the corpus).
```bash
```bash
python3 TrainValTest7_Random.py
python3 TrainValTest7_Random.py
```
```
### Graphics
### • Graphics
**TrainValTest8_Random.py :** Bar chart to display the number of words per Bin.
**TrainValTest8_Random.py :** Bar chart to display the number of words per Bin.