Search results

  1. Novus.Toto

    Whisper and its many forms

    Here is a modified version of porgate55555's translation by chunks script. It provides a crude work-around for the untranslated lines issue by re-translating missed lines one-by-one. The one-by-one translation can take quite a while. pip install openai python-dotenv tqdm Add your API key to...
  2. Novus.Toto

    Whisper and its many forms

    I have modified the chunks version of your script so it translates any untranslated lines, one-by-one, after translation by chunks. This is obviously not a real solution, but just a non-ideal workaround. It can take quite a while to do those line-by-line translations. The modified script also...
  3. Novus.Toto

    Whisper and its many forms

    I’ve tried the chunks version of your script, just once so far, and also got untranslated lines and sections. Not sure if it’ll help you solve the issue but I noticed that most of the untranslated parts in my test come immediately after an extremely long line (like a hallucination line with...
  4. Novus.Toto

    Whisper and its many forms

    I queued four files to translate with VSM yesterday. Two failed and the other two were much slower than previous times. Maybe there is or was an issue at DeepSeek rather than at our end? And it's great you've written your scripts. I'm going to try them. I was planning to do the same myself but...
  5. Novus.Toto

    Whisper and its many forms

    Ah. Didn't notice that. Thanks.
  6. Novus.Toto

    Whisper and its many forms

    Hmm. This sounds interesting. I'd like to have a play. Can I assume you redirected to the DeepSeek end point by changing the ChatGPT Auto-translate settings in SubtitleEdit? But how did you get the DeepSeek model as an option available to select?
  7. Novus.Toto

    Whisper and its many forms

    Averages about 1.5 cents per video hour using the API. DDB-271, that I processed yesterday, was very speech-dense and cost 2.5 cents per video hour. Tokens are currently are an introductory price. Cost is going to increase in early Feb.
  8. Novus.Toto

    Whisper and its many forms

    I did some experimenting this today. I generated subtitles for DDB-271 (the shorter, hard-subtitled version available for download in several places) using several different workflows. I chose this video as it's quite short (01:12:29), it's got a lot of explicit language, there are...
  9. Novus.Toto

    Whisper and its many forms

    I just tried the Mistral API and get a 404 error when trying to connect. Probably I've set up the API incorrectly. I'll have another try later. And maybe try one of their models locally.
  10. Novus.Toto

    Whisper and its many forms

    You mean loosing the explicit language with Whisper? It's true Whisper is very averse to vulgarity but, even so, I think I preferred end-to-end Whisper to DeepL. Though I only tried DeepL a couple of times, got incoherent output, and gave up on it. Maybe I should have persisted.
  11. Novus.Toto

    Whisper and its many forms

    I have once or twice. Like Artemis I think end-to-end Whisper does a better job. And DeepL is much inferior to DeepSeek IMO.
  12. Novus.Toto

    Whisper and its many forms

    This is what I do to use DeepSeek. I expect it’s a far from optimal workflow. 1. Extract the audio Using VLC No additional audio processing is done 2. Transcribe the audio using WhisperWithVAD_pro Using default settings 3. Install Video Subtitle Master (because I’m lazy I’ll call it VSM)...
  13. Novus.Toto

    Whisper and its many forms

    Hi Artemis I'm going fishing soon but will provide more details on the method I use - probably tomorrow.
  14. Novus.Toto

    Whisper and its many forms

    I'm not sure if this is allowed in this thread as it's not entirely Whisper related. Mods, please remove if it's inappropriate. I've been using WhisperJAV and WhisperWithVAD_pro to generate subtitles that (no shade on mei2 or the Whisper model creators intended) have translations that range in...