Right, I tried my first whisper translation. I got a lot of repeating lines, but not just a single line repeating (though there were some of those as well) but groups of lines. For example:
123 00:08:16,000 --> 00:08:18,000 Is it true?
124 00:08:18,000 --> 00:08:22,000 No, it's not.
125 00:08:22,000 --> 00:08:25,000 I'm still in the hospital.
126 00:08:25,000 --> 00:08:28,000 I'm still in the hospital.
127 00:08:28,000 --> 00:08:30,000 Is it true?
128 00:08:30,000 --> 00:08:32,000 No, it's not.
Does anyone have an idea what settings I should change to avoid these?
I am running it on my own computer instead of online, because the online thing kept giving me errors. This means I have to use the small model, I hope that's not what is causing this issue.
Set "condition_on_previous_text" or whatever it is to False. In theory, it is there to make it learn into consistent style and slightly bias the results towards specific topics that the entire source is about. In practice specifically for porn films of languages other than English, there's going to be a lot of scenes where it can't really make any good guess, which leads to the slight bias on previous text pushing it towards the exact same "answer", and then that bias for the previous bias layers with the bias from two lines ago, and so on, leading it to land in long ruts of the same sentence it it gets a nice clear answer for it to change its mind.
The Small model definitely contributes to this, but the condition_on_previous_text is probably the larger influence.
The Small model will be enough to get the general gist of plot scenes, but will do much worse on sex dialogue and will need quite a bit more editing and interpretations to get into a good spot. If you are limited, it can be good practice to run the same film through several different settings, making sure to save the outputs to different names, and reading them side by side and copying the best into a new file.