Post your JAV subtitle files here - JAV Subtitle Repository (JSP)★NOT A SUB REQUEST THREAD★

Makkdom

Well-Known Member
Mar 4, 2019
157
388

APNS-305 As long as it's for the sake of the person I love, I don't mind being circulated, defiled, or seeded by people I don't know... Jun Suehiro​


Another file I found and worked on for a while. A young girl agrees to help her boyfriend pay off some debts and spirals into a pretty degrading life.


apns00305pl.jpg
 

Attachments

  • APNS-305-en.zip
    15 KB · Views: 316
Last edited:

SamKook

Grand Wizard
Staff member
Super Moderator
Uploader
May 10, 2009
3,755
5,154
Whenever I use the Subtitle Edit program I always like to convert to Subrip type file once the auto translate function is finished. Not even sure how I landed on that as a fav. Perhaps I'm missing out, but it seems to streamline the subs while maintaining the ability to edit properly. I'd be curious to know if there is a reason you prefer ASS (as opposed to Subrip or ASSA).

ASSA is the same thing as .ass btw.

Unless you want to do something more than just basic text on a video, there's really no good reason to use .ass over subrip(.srt) which is likely the most supported subtitle format.

.ass just gives you a lot of control over how the text is displayed. You can change the font of individual lines, color them, move them to a specific position, scale them bigger or smaller, add gradients to the text, fade effects, stuff like that. You can even draw shapes if you want to block some text on screen and add subtitles over that part.

Here's an example of something fancy I did back in the day, the "B&L's Style Romance!" text in that picture is 100% soft sub, you can toggle it on and off. It's 9 different subtitle lines(and possibly 2 more lines for the gradient and something else, forgot how that works 7 of which are to create the gradient effect, it only shows a slice of each line of different colors) stacked onto each other to mimic the style of the japanese title.
MM08_title.jpg

Edit: One thing I forgot about that could be useful with .ass even if you only want basic text is that you can set an actor name for each line which doesn't get displayed into the subs so if you find it helpful to know who is saying what when editing, that's one thing .srt can't do.
 
Last edited:

ironfevers

Active Member
Nov 9, 2019
37
157
Just a tip on Whisper about translation: Don't use it. Just choose No Translation and do that later in Google translate or DeepL translate

I have experimented with VAD-Threshold and have settled on 0.3. What does Chunk_Threshold (3.0) do? Size of the audio parts analyzed? I'll try to lower and raise the number to see if there are improvements. Source separation didn't work for me. I get error messages.
I also arrived at the same conclusions. After extensive testing, I've settled on 3.2 chunk_threshold and 0.3 vad_threshold for a good balance between catching short sentences and natural sounding speech. Source separation also fails to work. The spleeter package is to blame.
I have increased volume with all three types of audio to text and if you don't go overboard with the volume increase it has always yielded better results. Amplification is also better than compression. Some software doesn't like clipping so you may have to think about that, too.
I also use amplification. I amplify 1 db in Audacity and export as wav.


I have Whisper with VAD installed locally for long duration videos such as HUNTB, but I use the medium model with an RTX 3070 as large requires more VRAM. I also tested max_retries. One file gave me 256 chunks. 52 chunks (20%) needed a retry. Of those 52 chunks, 45 (87%) still failed with a max_attempts of 24. The rest succeeded within 3 retries. Conclusion, stick with a max_attempts value of 1, 2 or 3. I stick with no translation and just translate using DeepL. No one talked about the garbage_list. It'll save you so much time from cleaning up the subtitle. I added over 1000 Japanese words to the garbage list. https://drive.google.com/file/d/1AwJ-VdxA4jM6yJS37Tdxrbj1uAgaU08j/view?usp=share_link. Anything that sounds like laughing, screaming, moaning, etc. get deleted from the subtitle. The garbage list is not complete, there is always more to add. Also I changed the out_path filename so that it includes the settings used. No need to upload to the sidebar in Colab. If mounting your google drive, an example audio_path is "/content/drive/My Drive/temp/abc-123-1db.wav".

aldn034pl.jpg
gvh454pl.jpg
 

Attachments

  • gvh-472-large.whisper.ja.zip
    27.1 KB · Views: 316
  • huntb-241-medium.whisper.ja.zip
    107.7 KB · Views: 284
  • huntb-222-medium.whisper.ja.zip
    55.3 KB · Views: 299
  • gvh-454-large.whisper.ja.zip
    51.8 KB · Views: 272
  • aldn-034-medium.whisper.ja.zip
    78 KB · Views: 295
  • gvh472pl.jpg
    gvh472pl.jpg
    179.9 KB · Views: 229
  • huntb241pl.jpg
    huntb241pl.jpg
    209.2 KB · Views: 231
  • huntb222pl.jpg
    huntb222pl.jpg
    213.7 KB · Views: 227

Chuckie100

Well-Known Member
Sep 13, 2019
721
2,818
RCTD-505 If You Can Endure Ejaculation With Your Mother's Doskebe Fellatio,
A Prize Of 1 Million Yen If You Explode Incest Punishment Game 2
342742_3xplanet_RCTD-505_cover.jpg

I used Whisper to produce this subtitle file for RCTD-505. This is the latest mother-son JAV from Rocket. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.​

 

Attachments

  • RCTD-505 - Copy.rar
    23 KB · Views: 359

r00g

Member
Jul 25, 2009
84
70
@ironfevers - thanks for the Whisper-VAD.py file! I started to do the same thing, but went a different direction after some thrashing around w/ Python. Will give this a shot soon.
 

Imscully

Well-Known Member
Apr 1, 2014
361
640
ASSA is the same thing as .ass btw.

Unless you want to do something more than just basic text on a video, there's really no good reason to use .ass over subrip(.srt) which is likely the most supported subtitle format.

.ass just gives you a lot of control over how the text is displayed. You can change the font of individual lines, color them, move them to a specific position, scale them bigger or smaller, add gradients to the text, fade effects, stuff like that. You can even draw shapes if you want to block some text on screen and add subtitles over that part.

Here's an example of something fancy I did back in the day, the "B&L's Style Romance!" text in that picture is 100% soft sub, you can toggle it on and off. It's 9 different subtitle lines(and possibly 2 more lines for the gradient and something else, forgot how that works 7 of which are to create the gradient effect, it only shows a slice of each line of different colors) stacked onto each other to mimic the style of the japanese title.
View attachment 3164793

Edit: One thing I forgot about that could be useful with .ass even if you only want basic text is that you can set an actor name for each line which doesn't get displayed into the subs so if you find it helpful to know who is saying what when editing, that's one thing .srt can't do.
Thanks for the great, comprehensive response.
Much appreciated.
I owe you.
 
  • Like
Reactions: Taako

Chuckie100

Well-Known Member
Sep 13, 2019
721
2,818

DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil

1avnsuorhjh0.jpg

I used Whisper to produce this subtitle file for DTKM-30. This is another mother-son swap JAV, staring two hot MILFs and where the son gets busted at the end! I had to keep updating the names as Whisper was not consistent. Still not sure if I got them totally correct! As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.​

 

Attachments

  • DTKM-030A.rar
    14.3 KB · Views: 209
  • DTKM-30B.rar
    9.1 KB · Views: 193

Chuckie100

Well-Known Member
Sep 13, 2019
721
2,818

JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura

juy887pl.jpg

I used Whisper to produce this subtitle file for JUY-886 starring Eriko Miura. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. And Yea, I twisted it to be about a mother and her son! (You can untwist it be replacing "...father" with "...husband" and appropriate pronoun.) Again, I don't understand Japanese or Chinese so my other re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.​

 

Attachments

  • JUY-887.rar
    12.6 KB · Views: 258

Imscully

Well-Known Member
Apr 1, 2014
361
640

JUY-887 One Weekend I Lied To My Wife About Going On A Business Trip, And Once She Thought I Had Left, She Fucked The Shit Out Of The Guy She's Been Cheating With. I Stayed Hidden In The House The Whole Time And Saw Everything... Eriko Miura

juy887pl.jpg

I used Whisper to produce this subtitle file for JUY-886 starring Eriko Miura. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. And Yea, I twisted it to be about a mother and her son! (You can untwist it be replacing "...father" with "...husband" and appropriate pronoun.) Again, I don't understand Japanese or Chinese so my other re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.​

Thanks, Chuckie!!!
 
  • Like
Reactions: Taako

Safadinho

New Member
Dec 26, 2021
20
10

DTKM-030 And Because I’ll Not Inspire My Mother, And Me Yarra Let Kimi Mom. Wada HyakuMika Okumura Pupil

1avnsuorhjh0.jpg

I used Whisper to produce this subtitle file for DTKM-30. This is another mother-son swap JAV, staring two hot MILFs and where the son gets busted at the end! I had to keep updating the names as Whisper was not consistent. Still not sure if I got them totally correct! As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.​

Thank you very much for this. This looks really good.
 
Last edited:
  • Like
Reactions: Taako and Imscully
Feb 25, 2023
13
40
I purposefully included the address bar in the screenshot from the post I made above yours so people would know, but here it is anyway, it's the WhisperWithVAD link people keep posting here: https://colab.research.google.com/github/ANonEntity/WhisperWithVAD/blob/main/WhisperWithVAD.ipynb


And yeah, seems to be that way looking at the github page for the cpp port someone made, it probably doesn't include the internal ffmpeg processing yet that the python version has:
Code:
Note that the main example currently runs only with 16-bit WAV files, so make sure to convert your input before running the tool. For example, you can use ffmpeg like this:
ffmpeg -i input.mp3 -ar 16000 -ac 1 -c:a pcm_s16le output.wav
You should also make it mono(-ac 1) unlike your previous example or else you're getting a file that's double the size for nothing.
I know this is a little old, but I've been running whisper on an mac M1. I experimented around with the different models and found the medium model to perform slightly better. Though the best translation I've found was from a fine-tuned model on hugging face that was specific for japanese. I'm pretty impressed with combining whisper (transcription) and deepl (translation).
 
Feb 25, 2023
13
40
I also arrived at the same conclusions. After extensive testing, I've settled on 3.2 chunk_threshold and 0.3 vad_threshold for a good balance between catching short sentences and natural sounding speech. Source separation also fails to work. The spleeter package is to blame.

I also use amplification. I amplify 1 db in Audacity and export as wav.


I have Whisper with VAD installed locally for long duration videos such as HUNTB, but I use the medium model with an RTX 3070 as large requires more VRAM. I also tested max_retries. One file gave me 256 chunks. 52 chunks (20%) needed a retry. Of those 52 chunks, 45 (87%) still failed with a max_attempts of 24. The rest succeeded within 3 retries. Conclusion, stick with a max_attempts value of 1, 2 or 3. I stick with no translation and just translate using DeepL. No one talked about the garbage_list. It'll save you so much time from cleaning up the subtitle. I added over 1000 Japanese words to the garbage list. https://drive.google.com/file/d/1AwJ-VdxA4jM6yJS37Tdxrbj1uAgaU08j/view?usp=share_link. Anything that sounds like laughing, screaming, moaning, etc. get deleted from the subtitle. The garbage list is not complete, there is always more to add. Also I changed the out_path filename so that it includes the settings used. No need to upload to the sidebar in Colab. If mounting your google drive, an example audio_path is "/content/drive/My Drive/temp/abc-123-1db.wav".

View attachment 3164884
View attachment 3164885
So I do something similar I have one list that are just strings to ignore. A lot of 'door closing', 'ghost screaming' or various sounds. I also have a two column file that i use for minor replacements, but yeh the file keeps getting larger and larger. I'll do an initial translation, and look at all the lines sorted in a text file and then copy the bad lines over to my text files i use to filter. I've noticed a lot of my bad lines have ()s in them so makes it a little easier. I use the medium model as well, not because of speed, but just because I noticed it tends to be more accurate. After processing with deepl I'll have 3 SRT files, one english, one japanese and one i leave as dual.
 

TmpGuy

JavLuv author, lesbian connoisseur
Jun 1, 2013
797
1,055
Here's another update of my "all subtitles ever posted to this thread" collection.

Since then, there have been 722 subtitles posted.
v4-akiba-import.png

Version 4 is now up to 33,066 subtitles. All files have been renamed, language IDs added, checked for issues, and sorted alphabetically by ID. Enjoy!

FileJoker:

Rapidgator:

Quick plug: JavLuv 1.1.22 was used to sort these subtitles, and can also match your movie collection to available subtitles as well. It's completely free and open source, and I support it here in these forums.
 

ericf

Well-Known Member
Jan 13, 2007
234
529
I've tried both stereo and mono and from what I can tell, I always get better results starting with a stereo file. It makes no difference if Whisper converts that stereo track into a mono track. My tests point to better results if the original track is in stereo. And for me there's pretty much no difference between two passes on the same file. I might get a couple more commas or full stops and one or two words in a few sentences will be different, but on the whole, I get the same content. If I change the Threshold, though, there are always clear differences. So, for me, I'll stick with my stereo uploads.
 

SamKook

Grand Wizard
Staff member
Super Moderator
Uploader
May 10, 2009
3,755
5,154
I've tried both stereo and mono and from what I can tell, I always get better results starting with a stereo file. It makes no difference if Whisper converts that stereo track into a mono track. My tests point to better results if the original track is in stereo. And for me there's pretty much no difference between two passes on the same file. I might get a couple more commas or full stops and one or two words in a few sentences will be different, but on the whole, I get the same content. If I change the Threshold, though, there are always clear differences. So, for me, I'll stick with my stereo uploads.
Have you tried making a mono file in the same way whisper does, with ffmpeg, to test it? Maybe whatever you're using is not using the same process and creating a different result or if you skip also making it 16k and leaving it at the original, it influence things.

Just curious because it makes no sense that it would be any different unless the resulting audio is altered in some way.

Edit: With the above, I assumed you used wav when making them mono but if that's not the case, then that's likely the problem right there, you're doing an extra lossy encoding pass so you're losing details more than if you didn't do anything.
 
Last edited:
  • Like
Reactions: ericf

lock_on

Member
Jan 22, 2023
42
4
Here's another update of my "all subtitles ever posted to this thread" collection.

Since then, there have been 722 subtitles posted.
View attachment 3168627

Version 4 is now up to 33,066 subtitles. All files have been renamed, language IDs added, checked for issues, and sorted alphabetically by ID. Enjoy!
Thanks, but do you have the updated version only? Because it takes time to download all over again only for duplicates.
 
  • Like
Reactions: Prinsipe

Prinsipe

Member
Aug 31, 2013
58
19
Thanks, but do you have the updated version only? Because it takes time to download all over again only for duplicates.
Yes i agree. It is also better to create a folder for new or additional subtitle only like what others are doing to avoid file duplication if we already had the other files before.
 

wwilk

Active Member
Nov 20, 2009
403
240
Thank you for doing this. I wish more were in English but oh well.
Here's another update of my "all subtitles ever posted to this thread" collection.

Since then, there have been 722 subtitles posted.
View attachment 3168627

Version 4 is now up to 33,066 subtitles. All files have been renamed, language IDs added, checked for issues, and sorted alphabetically by ID. Enjoy!

FileJoker:

Rapidgator:

Quick plug: JavLuv 1.1.22 was used to sort these subtitles, and can also match your movie collection to available subtitles as well. It's completely free and open source, and I support it here in these forums.