You need to transcribe it first from audio-to-text You can try py-transcribe. In Mac, there's an app called Libretto, which I think is a front-end of py-transcribe. In my experience both have bad results with inaccurate time stamps.But that is to translate text-to-text. If you want to translate audio to subtitles?
if you have experience with python, py-transcribe, scikit-learn, keras and tensorflow:You need to transcribe it first from audio-to-text You can try py-transcribe. In Mac, there's an app called Libretto, which I think is a front-end of py-transcribe. In my experience both have bad results with inaccurate time stamps.
With actors talking in whispers and often talking while other is talking/moaning, it's a difficult job for the transcription software. There's also the background music.
There are online transcription services, but they're costly and usually does not make .srt file.
Only if my data scientist friend was into JAV, he would have done something similar. Sadly he is not he has mad alot of programs for himself though to maintain and organize his JAV database and media.if you have experience with python, py-transcribe, scikit-learn, keras and tensorflow:
- use an audio to text transcriber in order to get input data
- when you’ve built a collection of data, instead of trying to make subtitles, set up like an excel spreadsheet and tag “hits” and “misses” (if the transcription is dialogue, it’s a hit, if it came from a moan or anything else, it’s a miss)
- get a lot of this kind of data and tag it appropriately
- when you have this data, use ML to build a binary classifier or perform logistic regression on the data. a standard neural network with a few layers is probably fine. the key being you want to train input data so the model will recognize when transcribed audio is anything of worth or not. like, it’ll probably spit out a percentage that you can use.
- after training the model, you can then set up a function that’ll take audio transcription data, perform a prediction using your model, and then pass or fail it based on whatever tolerance you set (i.e. 80% chance it’s dialogue) and if it passes, output the transcription to a translator, and then log the translated text
of course it’s probably more work than worth and definitely easier said than done. this is an extremely high level explanation of the process to machine learning and theres a ton i glossed over.
haha I didnt know thta was possible to upload just audio, A long time back when I got desperate to find subtitle for a particular ID, I thought why not try YouTube just for translation, So I tried to upload the JAV to youtube set to private so I can auto translate and watch it on the site(I thought I had the perfect plan), apparently youtube recognizes adult content and deletes it automatically after upload LOL.You could probably also upload the audio track to yt and download the auto captions. Unlike for en the results aren't amazing for ja though.
1. AutoGenSubGui (portable version of autosub with gui, just download, unpack and use)
Link1 , Link2
For better results use: from Japanese to Japanese.
2. deepl.com (use Translate Files)
AutoGenSubGui generates srt file. Copy content from srt file to Word docx. Translate whole docx file via deepl. Open translated file and safe again in doc format to bypass restrict editing Detailed instruction. Create new srt file and copy content from doc file. DONE
If video have good audible speech , it is possible to get pretty good results.
1. AutoGenSubGui (portable version of autosub with gui, just download, unpack and use)
Link1 , Link2
For better results use: from Japanese to Japanese.
2. deepl.com (use Translate Files)
AutoGenSubGui generates srt file. Copy content from srt file to Word docx. Translate whole docx file via deepl. Open translated file and safe again in doc format to bypass restrict editing Detailed instruction. Create new srt file and copy content from doc file. DONE
If video have good audible speech , it is possible to get pretty good results.
Do you have to use a video file or can you just use the audio? That should help you avoid the problem. The other option is to download a lower resolution version of the video. After all, you don't need high video resolution to make the subs, it's all about the audio.I have tryed AutoGenSubGui and it works well with trailers of 1 minute, but it says "memory error" when I try to translate a movie (5 GB). I have a modern PC, with 32 GB of RAM and a lot of HD space.
EDIT: Looking at the log, the error appears when it tries to extract the audio... I have tryed with two PCs and different videos, with no luck...
EDIT 2: It works well with files around 2 GB, but with bigger files it says "memory error". 99% of modern movies are bigger than 2 GB...
Good advise for allThere are many ways, utilizing the various translation services on the internet. This only what I know:
1. Write a small program, for example see this: https://towardsdatascience.com/language-translation-using-python-bd8020772ccc
2. Use services such as DeepL (https://www.deepl.com/) or subtitlecat (https://www.subtitlecat.com/)
2a. DeepL does not accept subtitle format such as srt, so you need to convert it to docx (change the extension to .txt, open in MS Word and Save as a Word document). The result from DeepL also needs to be copied back to an srt file (if you just convert it to text and add the extension .srt, it won't open). DeepL limit how many file you can translate each day/month unless you go pro.
2b. Subtitlecat, upload the original srt file, and then you can select the target language. There's also limitation unless you subscribe.
3. Use free translation such as https://translate.google.com/ https://www.bing.com/Translator/ https://translate.yandex.com/ https://papago.naver.com/
There's a character limit, usually 5,000 characters, that you can do in 1 go. If your translating from an .srt file, limit your translation to 50 dialog lines (for languages using alphabets) or 100 lines (languages using kanji). Open the .srt file using a text editor (I'm using BBEdit Free, since I'm on a Mac) and copy paste, including the time stamps (remember the character limit). Then copy back the results.
There are also applications that act as front end of the service. Like this
View attachment 2923159
I normally use this method now.
I hope this information helps.