Search results

Akiba-Online is sponsored by FileJoker.

FileJoker is a required filehost for all new posts and content replies in the Direct Downloads subforums.

Failure to include FileJoker links for Direct Download posts will result in deletion of your posts or worse.

For more information see this thread.

N
akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

I've noticed the way you split lines makes a difference with DeepL. Translating each line separately removes context, but running them together makes it blend the lines and repeat itself. See avatarthe's tests: The best fix I've found is putting quotes between the lines. 「」 and "" work about...
- Non_Entity
- Post #1,193
- Oct 8, 2022
- Forum: JAV Discussion
N
akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

The 413 error should be fixed now. It'll also save the original Japanese output in a separate file when using DeepL (not possible when Whisper translates it). Vosk is based on Kaldi. It uses an acoustic model that looks at each fraction of a second, and predicts individual phonemes. Then a...
- Non_Entity
- Post #1,184
- Oct 2, 2022
- Forum: JAV Discussion
N
akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

It's how many seconds the VAD waits before splitting the audio. Each chunk goes through Whisper separately, which prevents it from getting stuck on one line for the whole video.
- Non_Entity
- Post #1,177
- Sep 29, 2022
- Forum: JAV Discussion
N
akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

I've updated the notebook. It should be much less likely to output things like "Please subscribe" now. For the rest of the video, or just one scene? You could try lowering the chunk_threshold to 2.0 or 1.0 if that happens.
- Non_Entity
- Post #1,175
- Sep 29, 2022
- Forum: JAV Discussion
N
akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

I've written a notebook that combines Whisper with a separate VAD. It works much better than Whisper alone on long-form inputs, and also runs about 2-4x faster. https://colab.research.google.com/github/ANonEntity/WhisperWithVAD/blob/main/WhisperWithVAD.ipynb It's still far from perfect, though...
- Non_Entity
- Post #1,171
- Sep 27, 2022
- Forum: JAV Discussion
N
akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

A big hurdle towards fine-tuning Whisper (or any other model) is a lack of Japanese training data. OpenAI's dataset has 15,914 hours of JP audio (7054 hours with Japanese transcripts, 8860 with English ones), and even that dwarfs the publicly available ones I'm aware of. Is anyone interested in...
- Non_Entity
- Post #1,165
- Sep 26, 2022
- Forum: JAV Discussion

Top Bottom

Search results

akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★