Post your JAV subtitle files here - JAV Subtitle Repository (JSP)★NOT A SUB REQUEST THREAD★

porgate55555

Active Member
Jul 24, 2021
51
163
ffmpeg can do some processing on audio as it extracts it from a video file. I am currently using this command that will find all video files in a directory and then write out an mp3 for each. It also does a dynamic normalization to increase the volume of quiet parts of the track. (This is a bash shell command.)

Code:
find . -type f -iregex '.*\.\(m4v\|mp4\|mov\|wmv\|avi\|mpg\|mpeg\|rmvb\|rm\|flv\|asf\|mkv\|webm\)' -print0 | parallel -0 ffmpeg -loglevel 0 -i {} -af "dynaudnorm=f=150" -ac 1 -ar 16000 -vn {.}.mp3

There are other ffmpeg settings that can do dynamic range compression and other fancy things.
https://ffmpeg.org/ffmpeg-all.html#dynaudnorm

Have had very good luck w/ the above, so not seeing much need to try and isolate voices from the regular audio track,
Sounds interesting. Do you know how the command should look like for command line users like me?
 

SamKook

Grand Wizard
Staff member
Super Moderator
Uploader
May 10, 2009
3,737
5,131
Haven''t tested it but you can edit the code on those colab and the vad one already has a list of words it replaces so nothing would stop anyone from adding their own list of words to get replaced there.
 

superman4207

JAV Perv Enthusiast
Jul 4, 2022
49
88
Just a tip on Whisper about translation: Don't use it. Just choose No Translation and do that later in Google translate or DeepL translate (use Tor Browser so you don't get blocked for using up your free ratio (500.000 characters a month). You will need to separate the text from the timing first and then re-attach it as some of the translators introduces glitches in it.
I don't know how good the translator Whisper uses is (you can choose DeepL, too) but if you don't have a basic reference to what is actually transcribed, you can't figure out what the line is if the translator produces garbage.

I can only use the collab page so many of the 'settings' mentioned in posts before this can't be used. I have experimented with VAD-Threshold and have settled on 0.3. What does Chunk_Threshold (3.0) do? Size of the audio parts analyzed? I'll try to lower and raise the number to see if there are improvements. Source separation didn't work for me. I get error messages.
If you're just using Whisper for the transcribing into Japanese and avoiding the translation, you can drop the .srt into the free Subtitle Edit program and easily auto-translate it. I believe it uses Google to translate languages. I have been doing this for years with the Chinese subtitle files I could find and translating them to English. It was my go-to method before Whisper came around.
 

Chuckie100

Well-Known Member
Sep 13, 2019
710
2,768

DVDMS-913 A Busty Wife Who Turned Into A Sow Semen Tank In Just 3 Days Yuria Yoshine

dvdms913pl.jpg

I used Whisper to produce this subtitle file for DVDMS-913. A little different story line than I prefer but nice boobs. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.​

 

Attachments

  • DVDMS-913 - Copy.rar
    16.9 KB · Views: 362

panop857

Active Member
Sep 11, 2011
168
237
I think Whisper Transcribe + Google Translate is a sidegrade. Not seeing consistent advance to it. Some lines better, some are worse. An adjusted Whisper model will likely beat both.
 

Chuckie100

Well-Known Member
Sep 13, 2019
710
2,768
If you're just using Whisper for the transcribing into Japanese and avoiding the translation, you can drop the .srt into the free Subtitle Edit program and easily auto-translate it. I believe it uses Google to translate languages. I have been doing this for years with the Chinese subtitle files I could find and translating them to English. It was my go-to method before Whisper came around.
How do you turn off the translation in Whisper? Does Subtitle Edit also have a limited "Lewd" vocabulary like Whisper?
 

Chuckie100

Well-Known Member
Sep 13, 2019
710
2,768

VOSS-143 The Bride Left The Letter And Suddenly Left Home! "My Mother-in-law, Who Has Been Worried About Me Who Is Depressed From Now On, Should Stay At Home ... But I Forgot To See Her Almost Nakedly In The Bath After Walking Up The House, Forgetting Her Bride For Some Time It Was Attacked And Put Out In The Middle! !6

voss143pl.jpg

I used Whisper to produce this subtitle file for VOSS-143 starring Eriko Miura. Perhaps a little unbelievable story line, but I really like this actress. As always however, I still had to clean it up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese or Chinese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.​

 

Attachments

  • VOSS-143.rar
    9.5 KB · Views: 245

Makkdom

Well-Known Member
Mar 4, 2019
157
388
And yes, lol, the lewd language barrier... It isn't entirely a deal breaker to me. I've been trying to learn Japanese and had in the past tried to sub videos on my own, but man, that process takes days or weeks for someone like me who doesn't know Japanese super well.

The way I look at it, optimistically, is that I am very familiar with the lewd Japanese words so I can easily go back and clean those parts up if I want to, but I don't really need the subs since I already understand what they are saying. Most of the subs I am searching for in a video are in the conversational foreplay/plot to heighten the sex, because nobody does porn plots like the Japanese, lmao. Now I just have to remember all the videos I've been desperate for subs for...
I get your point about a lot of the lewd language during sex scenes as being nonessential. Like you, I feel the context around the actual sex is much more important. Mark Twain once wrote, “The difference between the almost right word and the right word is really a large matter. 'tis the difference between the lightning bug and the lightning.” And sometimes the same is true in translating JAV dialogue. For example, The recent Whisper file I worked on for ATID-497 had the following for the main character's self-reflection after cheating on her husband: "I'm the worst. I kiss my husband with the same mouth as the other men." What she actually says is closer to, "I'm the worst. I kiss my husband with the same mouth that drank another man's sperm." Quite a difference.
 

Chuckie100

Well-Known Member
Sep 13, 2019
710
2,768
WTF? Is Whisper listening and getting lewd-er? I just had Whisper subtitle VENUE-930 and got this:

595
02:12:01,074 --> 02:12:04,738
Takeshi...

596
02:12:06,738 --> 02:12:12,738
From now on, will you attack my nipples and pussies?

597
02:12:12,738 --> 02:12:16,738
Will you attack me and make me feel good?

598
02:12:17,738 --> 02:12:19,738
I understand.

I love AI! lol
 
  • Haha
  • Like
Reactions: Imscully and Zhonda

Zephlol

Member
Oct 2, 2022
20
36
[Cross-posted from the "Gokkun" thread.]

VOLUNTEER TIMER WANTED

While I’m still interested in hobby subtitling as a fan, I simply don’t have the time to perform each role in the fansubbing process myself anymore. So, I’m posting to gauge interest in forming a small group to fansub both censored and uncensored gokkun, bukkake, and nakadashi videos as a team. It only takes one person to get the ball rolling, so I’m looking for at least one volunteer timer to join me.

The role of a timer would involve taking a bilingual script and producing an srt file in a subtitling program by determining the exact frames subtitles begin and end according to my own guidelines. It’s meticulous and tedious work, but Japanese language skills aren’t strictly required, and there are a number of free, cross-platform subtitling programs available. The volume is unlikely to exceed thirty minutes of video a month, as my time is limited and I’m only interested in producing high quality fansubs to begin with. As a collaborator, you would also share say in what we fansub, as well as how and when.

If you’ve ever wished there were more fansubs in these genres available but lack the translation skills to produce them yourself, this would be a good opportunity to make a meaningful contribution beyond your own language abilities. Send me a DM if you’re interested and we can explore the possibilities.*

*And if you can’t be bothered to pitch in, don’t ask me to work on the weekends for free.
For subtitle timing, I usually just upload it to youtube or use adobe premiere. Their sub isn't great, but it will automatically give you perfect timing for subs
 

Imscully

Well-Known Member
Apr 1, 2014
359
636
WTF? Is Whisper listening and getting lewd-er? I just had Whisper subtitle VENUE-930 and got this:

595
02:12:01,074 --> 02:12:04,738
Takeshi...

596
02:12:06,738 --> 02:12:12,738
From now on, will you attack my nipples and pussies?

597
02:12:12,738 --> 02:12:16,738
Will you attack me and make me feel good?

598
02:12:17,738 --> 02:12:19,738
I understand.

I love AI! lol
Will you attack me and make me feel good?
Yeah, that's ok, I guess.
But...
I still prefer "Hey Mom...MAKE FOOD!"
(I swear, the whole "Make food" idea has had me laughing all week.)
 

panop857

Active Member
Sep 11, 2011
168
237
Will you attack me and make me feel good?
Yeah, that's ok, I guess.
But...
I still prefer "Hey Mom...MAKE FOOD!"
(I swear, the whole "Make food" idea has had me laughing all week.)

It is worth messing around with --suppress_tokens feature to see if this can be adjusted. Argument of "food" or ["food"] (not sure when to use which) will make it avoid using the word food, or words that are something similar.
 
  • Like
Reactions: Imscully

r00g

Member
Jul 25, 2009
84
70
It is worth messing around with --suppress_tokens feature to see if this can be adjusted. Argument of "food" or ["food"] (not sure when to use which) will make it avoid using the word food, or words that are something similar.

I've had no luck specifying --suppress_tokens as just comma separated words - whisper just refuses to run. I think the words have to be "tokenized" via some other process and turned into integers. If anyone has better luck getting that working, would love to know how to do it That said, I did have some success using --initial_prompt - my transcripts are ever so slighter naughtier

Code:
-initial_prompt 'cock pussy dick "deep throat" throbbing throb shaking cum come slut slick coming throat wet good very load balls lick ikku kimochi clitoris clit orgasm suck tits boobs breasts huge masturbate "fuck buddy" zubozubo hard binbin cunt fuck fucking ass crack lips' \
 
  • Like
Reactions: Imscully

avatarthe

Well-Known Member
Feb 1, 2008
184
282
Thanks for looking into it. That was exactly the issue. It merges the file, but combines the last 2 rows of a chunk together, so the whole thing is not usable. Thanks anyways :)

Edit: Was able to fix it after 3h of reverse engineering. Man was that a pain but was worth the effort. Your improvments make it also better. Thanks again!
I too am using a program to translate with deepl (deepl-srt) and it recently broke, likely due to the same changes to deepl site. I tried to make the suggested changes to he program but couldn't' find the lines, perhaps it's a different program?
Either way, would you mind posting the corrected version of your program so we can all use i?:

Thanks.