Post your JAV subtitle files here - JAV Subtitle Repository (JSP)★NOT A SUB REQUEST THREAD★

SamKook · Feb 21, 2023

Imscully said:
Whenever I use the Subtitle Edit program I always like to convert to Subrip type file once the auto translate function is finished. Not even sure how I landed on that as a fav. Perhaps I'm missing out, but it seems to streamline the subs while maintaining the ability to edit properly. I'd be curious to know if there is a reason you prefer ASS (as opposed to Subrip or ASSA).

ASSA is the same thing as .ass btw.

Unless you want to do something more than just basic text on a video, there's really no good reason to use .ass over subrip(.srt) which is likely the most supported subtitle format.

.ass just gives you a lot of control over how the text is displayed. You can change the font of individual lines, color them, move them to a specific position, scale them bigger or smaller, add gradients to the text, fade effects, stuff like that. You can even draw shapes if you want to block some text on screen and add subtitles over that part.

Here's an example of something fancy I did back in the day, the "B&L's Style Romance!" text in that picture is 100% soft sub, you can toggle it on and off. It's 9 different subtitle lines(~~and possibly 2 more lines for the gradient and something else, forgot how that works~~ 7 of which are to create the gradient effect, it only shows a slice of each line of different colors) stacked onto each other to mimic the style of the japanese title.

Edit: One thing I forgot about that could be useful with .ass even if you only want basic text is that you can set an actor name for each line which doesn't get displayed into the subs so if you find it helpful to know who is saying what when editing, that's one thing .srt can't do.

ironfevers · Feb 21, 2023

mei2 said:
A bit of off topic question: does anyone know how I can brun 2 subtitlles into a video (one on top and one on bottom of the screen)? Thanks.

Merge two subtitle files in different languages into one

This online tool will combine two subtitle files into one. You will see the dialogues in two languages at the same time.

easypronunciation.com

ironfevers · Feb 21, 2023

ericf said:
Just a tip on Whisper about translation: Don't use it. Just choose No Translation and do that later in Google translate or DeepL translate

I have experimented with VAD-Threshold and have settled on 0.3. What does Chunk_Threshold (3.0) do? Size of the audio parts analyzed? I'll try to lower and raise the number to see if there are improvements. Source separation didn't work for me. I get error messages.

I also arrived at the same conclusions. After extensive testing, I've settled on 3.2 chunk_threshold and 0.3 vad_threshold for a good balance between catching short sentences and natural sounding speech. Source separation also fails to work. The spleeter package is to blame.

ericf said:
I have increased volume with all three types of audio to text and if you don't go overboard with the volume increase it has always yielded better results. Amplification is also better than compression. Some software doesn't like clipping so you may have to think about that, too.

I also use amplification. I amplify 1 db in Audacity and export as wav.

I have Whisper with VAD installed locally for long duration videos such as HUNTB, but I use the medium model with an RTX 3070 as large requires more VRAM. I also tested max_retries. One file gave me 256 chunks. 52 chunks (20%) needed a retry. Of those 52 chunks, 45 (87%) still failed with a max_attempts of 24. The rest succeeded within 3 retries. Conclusion, stick with a max_attempts value of 1, 2 or 3. I stick with no translation and just translate using DeepL. No one talked about the garbage_list. It'll save you so much time from cleaning up the subtitle. I added over 1000 Japanese words to the garbage list. https://drive.google.com/file/d/1AwJ-VdxA4jM6yJS37Tdxrbj1uAgaU08j/view?usp=share_link. Anything that sounds like laughing, screaming, moaning, etc. get deleted from the subtitle. The garbage list is not complete, there is always more to add. Also I changed the out_path filename so that it includes the settings used. No need to upload to the sidebar in Colab. If mounting your google drive, an example audio_path is "/content/drive/My Drive/temp/abc-123-1db.wav".

Chuckie100 · Feb 21, 2023

RCTD-505 If You Can Endure Ejaculation With Your Mother's Doskebe Fellatio,
A Prize Of 1 Million Yen If You Explode Incest Punishment Game 2

I used Whisper to produce this subtitle file for RCTD-505. This is the latest mother-son JAV from Rocket. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.

r00g · Feb 21, 2023

@ironfevers - thanks for the Whisper-VAD.py file! I started to do the same thing, but went a different direction after some thrashing around w/ Python. Will give this a shot soon.

Imscully · Feb 21, 2023

SamKook said:
ASSA is the same thing as .ass btw.

Unless you want to do something more than just basic text on a video, there's really no good reason to use .ass over subrip(.srt) which is likely the most supported subtitle format.

.ass just gives you a lot of control over how the text is displayed. You can change the font of individual lines, color them, move them to a specific position, scale them bigger or smaller, add gradients to the text, fade effects, stuff like that. You can even draw shapes if you want to block some text on screen and add subtitles over that part.

Here's an example of something fancy I did back in the day, the "B&L's Style Romance!" text in that picture is 100% soft sub, you can toggle it on and off. It's 9 different subtitle lines(~~and possibly 2 more lines for the gradient and something else, forgot how that works~~ 7 of which are to create the gradient effect, it only shows a slice of each line of different colors) stacked onto each other to mimic the style of the japanese title.
View attachment 3164793

Edit: One thing I forgot about that could be useful with .ass even if you only want basic text is that you can set an actor name for each line which doesn't get displayed into the subs so if you find it helpful to know who is saying what when editing, that's one thing .srt can't do.

Thanks for the great, comprehensive response.
Much appreciated.
I owe you.

JAVinsight · Feb 22, 2023

OKSN-284 - Un-edited Subtitle.

Chuckie100 · Feb 23, 2023

DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil

I used Whisper to produce this subtitle file for DTKM-30. This is another mother-son swap JAV, staring two hot MILFs and where the son gets busted at the end! I had to keep updating the names as Whisper was not consistent. Still not sure if I got them totally correct! As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.

Chuckie100 · Feb 23, 2023

JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura

I used Whisper to produce this subtitle file for JUY-886 starring Eriko Miura. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. And Yea, I twisted it to be about a mother and her son! (You can untwist it be replacing "...father" with "...husband" and appropriate pronoun.) Again, I don't understand Japanese or Chinese so my other re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.

Imscully · Feb 23, 2023

Chuckie100 said:
JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura

I used Whisper to produce this subtitle file for JUY-886 starring Eriko Miura. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. And Yea, I twisted it to be about a mother and her son! (You can untwist it be replacing "...father" with "...husband" and appropriate pronoun.) Again, I don't understand Japanese or Chinese so my other re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.

Thanks, Chuckie!!!

Safadinho · Feb 24, 2023

Chuckie100 said:
DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil

I used Whisper to produce this subtitle file for DTKM-30. This is another mother-son swap JAV, staring two hot MILFs and where the son gets busted at the end! I had to keep updating the names as Whisper was not consistent. Still not sure if I got them totally correct! As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.

Thank you very much for this. This looks really good.

counter_productive · Feb 25, 2023

SamKook said:
I purposefully included the address bar in the screenshot from the post I made above yours so people would know, but here it is anyway, it's the WhisperWithVAD link people keep posting here: https://colab.research.google.com/github/ANonEntity/WhisperWithVAD/blob/main/WhisperWithVAD.ipynb

And yeah, seems to be that way looking at the github page for the cpp port someone made, it probably doesn't include the internal ffmpeg processing yet that the python version has:

Code:

Note that the main example currently runs only with 16-bit WAV files, so make sure to convert your input before running the tool. For example, you can use ffmpeg like this: ffmpeg -i input.mp3 -ar 16000 -ac 1 -c:a pcm_s16le output.wav

You should also make it mono(-ac 1) unlike your previous example or else you're getting a file that's double the size for nothing.

I know this is a little old, but I've been running whisper on an mac M1. I experimented around with the different models and found the medium model to perform slightly better. Though the best translation I've found was from a fine-tuned model on hugging face that was specific for japanese. I'm pretty impressed with combining whisper (transcription) and deepl (translation).

counter_productive · Feb 25, 2023

ironfevers said:
I also arrived at the same conclusions. After extensive testing, I've settled on 3.2 chunk_threshold and 0.3 vad_threshold for a good balance between catching short sentences and natural sounding speech. Source separation also fails to work. The spleeter package is to blame.

I also use amplification. I amplify 1 db in Audacity and export as wav.

I have Whisper with VAD installed locally for long duration videos such as HUNTB, but I use the medium model with an RTX 3070 as large requires more VRAM. I also tested max_retries. One file gave me 256 chunks. 52 chunks (20%) needed a retry. Of those 52 chunks, 45 (87%) still failed with a max_attempts of 24. The rest succeeded within 3 retries. Conclusion, stick with a max_attempts value of 1, 2 or 3. I stick with no translation and just translate using DeepL. No one talked about the garbage_list. It'll save you so much time from cleaning up the subtitle. I added over 1000 Japanese words to the garbage list. https://drive.google.com/file/d/1AwJ-VdxA4jM6yJS37Tdxrbj1uAgaU08j/view?usp=share_link. Anything that sounds like laughing, screaming, moaning, etc. get deleted from the subtitle. The garbage list is not complete, there is always more to add. Also I changed the out_path filename so that it includes the settings used. No need to upload to the sidebar in Colab. If mounting your google drive, an example audio_path is "/content/drive/My Drive/temp/abc-123-1db.wav".

View attachment 3164884
View attachment 3164885

So I do something similar I have one list that are just strings to ignore. A lot of 'door closing', 'ghost screaming' or various sounds. I also have a two column file that i use for minor replacements, but yeh the file keeps getting larger and larger. I'll do an initial translation, and look at all the lines sorted in a text file and then copy the bad lines over to my text files i use to filter. I've noticed a lot of my bad lines have ()s in them so makes it a little easier. I use the medium model as well, not because of speed, but just because I noticed it tends to be more accurate. After processing with deepl I'll have 3 SRT files, one english, one japanese and one i leave as dual.

TmpGuy · Feb 25, 2023

Here's another update of my "all subtitles ever posted to this thread" collection.

Since then, there have been 722 subtitles posted.

Version 4 is now up to 33,066 subtitles. All files have been renamed, language IDs added, checked for issues, and sorted alphabetically by ID. Enjoy!

FileJoker:

FileJoker

Filejoker.net - Free file upload, download service.

filejoker.net

Rapidgator:

Download file Subtitles.v4.zip

Download Subtitles.v4.zip fast and secure

rapidgator.net

Quick plug: JavLuv 1.1.22 was used to sort these subtitles, and can also match your movie collection to available subtitles as well. It's completely free and open source, and I support it here in these forums.

ericf · Feb 25, 2023

I've tried both stereo and mono and from what I can tell, I always get better results starting with a stereo file. It makes no difference if Whisper converts that stereo track into a mono track. My tests point to better results if the original track is in stereo. And for me there's pretty much no difference between two passes on the same file. I might get a couple more commas or full stops and one or two words in a few sentences will be different, but on the whole, I get the same content. If I change the Threshold, though, there are always clear differences. So, for me, I'll stick with my stereo uploads.

SamKook · Feb 25, 2023

ericf said:
I've tried both stereo and mono and from what I can tell, I always get better results starting with a stereo file. It makes no difference if Whisper converts that stereo track into a mono track. My tests point to better results if the original track is in stereo. And for me there's pretty much no difference between two passes on the same file. I might get a couple more commas or full stops and one or two words in a few sentences will be different, but on the whole, I get the same content. If I change the Threshold, though, there are always clear differences. So, for me, I'll stick with my stereo uploads.

Have you tried making a mono file in the same way whisper does, with ffmpeg, to test it? Maybe whatever you're using is not using the same process and creating a different result or if you skip also making it 16k and leaving it at the original, it influence things.

Just curious because it makes no sense that it would be any different unless the resulting audio is altered in some way.

Edit: With the above, I assumed you used wav when making them mono but if that's not the case, then that's likely the problem right there, you're doing an extra lossy encoding pass so you're losing details more than if you didn't do anything.

lock_on · Feb 26, 2023

TmpGuy said:
Here's another update of my "all subtitles ever posted to this thread" collection.

Since then, there have been 722 subtitles posted.
View attachment 3168627

Version 4 is now up to 33,066 subtitles. All files have been renamed, language IDs added, checked for issues, and sorted alphabetically by ID. Enjoy!

Thanks, but do you have the updated version only? Because it takes time to download all over again only for duplicates.

Prinsipe · Feb 26, 2023

lock_on said:
Thanks, but do you have the updated version only? Because it takes time to download all over again only for duplicates.

Yes i agree. It is also better to create a folder for new or additional subtitle only like what others are doing to avoid file duplication if we already had the other files before.

wwilk · Feb 26, 2023

Thank you for doing this. I wish more were in English but oh well.

TmpGuy said:
Here's another update of my "all subtitles ever posted to this thread" collection.

Since then, there have been 722 subtitles posted.
View attachment 3168627

Version 4 is now up to 33,066 subtitles. All files have been renamed, language IDs added, checked for issues, and sorted alphabetically by ID. Enjoy!

FileJoker:

FileJoker

Filejoker.net - Free file upload, download service.

filejoker.net

Rapidgator:

Download file Subtitles.v4.zip

Download Subtitles.v4.zip fast and secure

rapidgator.net

Quick plug: JavLuv 1.1.22 was used to sort these subtitles, and can also match your movie collection to available subtitles as well. It's completely free and open source, and I support it here in these forums.

lnstar · Feb 26, 2023

TmpGuy said:
Here's another update of my "all subtitles ever posted to this thread" collection.

Since then, there have been 722 subtitles posted.
View attachment 3168627

Version 4 is now up to 33,066 subtitles. All files have been renamed, language IDs added, checked for issues, and sorted alphabetically by ID. Enjoy!

FileJoker:

FileJoker

Filejoker.net - Free file upload, download service.

filejoker.net

Rapidgator:

Download file Subtitles.v4.zip

Download Subtitles.v4.zip fast and secure

rapidgator.net

Quick plug: JavLuv 1.1.22 was used to sort these subtitles, and can also match your movie collection to available subtitles as well. It's completely free and open source, and I support it here in these forums.

Thanks for the subs! For those of us who don't subscribe to filejoker, are the zip files available on other sites such as https://mega.nz , which don't requre a subscription? Thanks

Post your JAV subtitle files here - JAV Subtitle Repository (JSP)★NOT A SUB REQUEST THREAD★

Grand Wizard

Active Member

Active Member

Attachments

Well-Known Member

Attachments

Member

Well-Known Member

Well-Known Member

Attachments

Well-Known Member

DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil​

Attachments

Well-Known Member

JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura​

Attachments

Well-Known Member

JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura​

New Member

DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil​

Member

Member

JavLuv author, lesbian connoisseur

Well-Known Member

Grand Wizard

Member

Member

Active Member

New Member

DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil

JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura

JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura

DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil