Post your JAV subtitle files here - JAV Subtitle Repository (JSP)★NOT A SUB REQUEST THREAD★

maload

Active Member
Jul 1, 2008
697
145
Here is a machine generated sub for Yua Mikami SSIS-477. I'm using a new Japanese app --it looks promising so far. Better than Autosub and PyTranscribe, but still not perfect.

View attachment 3005563
can you give me japanese text version of this .srt please
i want to read it
 

ericf

Well-Known Member
Jan 13, 2007
234
529
Thanks for posting that links. I don't use that kind of script language a lot. Do I just copy the section up top that begins with:
"from huggingsound import SpeechRecognitionModel"
and input the full input/output paths to the audio in the command prompt and it will do its work?
I don't see any info on having to download any files.

Also, it says it needs input of 16 kHz sample rate. So you would need to convert every audio file in Audacity to 16 kHz.
Is there a runtime limit to the audio? Vrew has a 2 hr limit and I know pyTranscriber doesn't like long files either.

Once again, thank you for letting us know there are more options out there.
I do remember someone else mentioning a 'new' method of extracting text from audio in THIS thread. Never found a more detailed explanation.
 

mei2

Well-Known Member
Dec 6, 2018
246
406
can you give me japanese text version of this .srt please
i want to read it

Here you go, here is the Japanese text.
PS. the timing is off --there was a error in split and merge recalculation.
 

Attachments

  • SSIS-477.ja-JP (STT) v1.zip
    13.5 KB · Views: 208
  • Like
Reactions: xsf27

mei2

Well-Known Member
Dec 6, 2018
246
406
Thanks for posting that links. I don't use that kind of script language a lot. Do I just copy the section up top that begins with:
"from huggingsound import SpeechRecognitionModel"
and input the full input/output paths to the audio in the command prompt and it will do its work?
I don't see any info on having to download any files.

Also, it says it needs input of 16 kHz sample rate. So you would need to convert every audio file in Audacity to 16 kHz.
Is there a runtime limit to the audio? Vrew has a 2 hr limit and I know pyTranscriber doesn't like long files either.

Once again, thank you for letting us know there are more options out there.
I do remember someone else mentioning a 'new' method of extracting text from audio in THIS thread. Never found a more detailed explanation.
I presume you aready have your python environemnt set up, then you'd only need to install and import the modules.

Both the input and output need pre and post work. And you're right, using Audacity for 16khz would do the job. I use ffmpeg for the same. I cut the input to 30min chunks. I'm still looking for a pre-trained (Japanese) tokenizer for preprocessing.

For pyTranscriber I usually split the input to 5min chunks before feeding it to pyTransciber.

PS. I also noticed that the results are better when audio is converted from streo to mono.
PPS. Somewhere I read that Vrew gives better results if the audio is converted to female voice, and the volume is increased 5x. I haven't tried it though.
 

ericf

Well-Known Member
Jan 13, 2007
234
529
I have used Vrew a little and it finds more dialog than pyTranscriber (autosub) but it's not always more accurate. I thought it was more accurate with Korean since it uses a dedicated Korean api but I guess Google is superior in this case.
How would one convert audio to female voice? Higher pitch? I know that amplifying works only to a certain degree with Google API/Autosub and then it drops a lot of the dialog. It just doesn't recognize it. I've always wondered why that didn't work so well. I can't do any adjustments in the API, so doing adjustments to the audio should help. I've tried compression as well. I tend to do at least 3 versions of the audio and then I compare and choose the best versions and copy missing lines between scripts.
With mono tracks, are you saying that works better with pyTranscrier? I've tried a few times but have not had any difference between stereo and mono audio.
 
Last edited:

mei2

Well-Known Member
Dec 6, 2018
246
406
I have used Vrew a little and it finds more dialog than pyTranscriber (autosub) but it's not always more accurate. I thought it was more accurate with Korean since it uses a dedicated Korean api but I guess Google is superior in this case.
How would one convert audio to female voice? Higher pitch? I know that amplifying works only to a certain degree with Google API/Autosub and then it drops a lot of the dialog. It just doesn't recognize it. I've always wondered why that didn't work so well. I can't do any adjustments in the API, so doing adjustments to the audio should help. I've tried compression as well. I tend to do at least 3 versions of the audio and then I compare and choose the best versions and copy missing lines between scripts.
With mono tracks, are you saying that works better with pyTranscrier? I've tried a few times but have not had any difference between stereo and mono audio.

I haven't tried the voice gender conversion --it is indeed to shift the pitch, I understand. Here is a discussion that seems to point on Audacity plugin:

https://www.reddit.com/r/kdenlive/comments/jnt4vf
I'm impressed with you method to create 3 versions -- well done!

I will give Vrew a try. I wonder if it has a capability to learn/improve. I read that Vrew records the users activities. I wonder if that is for the purpose of improving the algorithm.
 
  • Like
Reactions: ericf

maload

Active Member
Jul 1, 2008
697
145
Here you go, here is the Japanese text.
PS. the timing is off --there was a error in split and merge recalculation.
thank you
 

ericf

Well-Known Member
Jan 13, 2007
234
529
Just a quick update: Don't use downsampled Mono with Vrew. I've only checked the Korean language extractor, but if you make it mono and 16Hz you'll get roughly a third less text in the file. I tried a mono file at 48Hz and it worked almost as well as an unadjusted stereo track. Amplify it in parts and you might catch some more dialog but the program tends to interpret amped content differently. Also, I tried a compressed file (Threshold -16db, Noise Floor: -35 db) and didn't see any real improvement.
All attempts at removing all sounds but the dialog have resulted in sound with a changed character. Either muffled or 'glitchy' and clipped. There is background noise some of the time but the most difficult parts are when there's music that's as loud as the dialog.
 
  • Like
Reactions: mei2

Taako

Akiba Citizen
May 25, 2017
1,335
940
Looks like someone is taking subs here and posting to their site or what ever. And they do not give credit to the person who created the sub(s) lol :rolleyes:

My recent sub of GG-016 was found here, https://javluna1.com/jav-english-su...-during-a-visit-to-my-parents-momoka-nishina/

And if the person had took the sub had read the summary, they would know...
I post only the short version here https://www.akiba-online.com/thread...not-a-sub-request-thread.1466451/post-4552511.

y1.jpgy2.jpgy3.jpgNOT the full sub. Which I decided not too lol

And the admin there say he created it hahaha :hehe:
 
Last edited:
  • Like
  • Wow
Reactions: xsf27 and Imscully

maload

Active Member
Jul 1, 2008
697
145
Looks like someone is taking subs here and posting to their site or what ever. And they do not give credit to the person who created the sub(s) lol :rolleyes:

My recent sub of GG-016 was found here, https://javluna1.com/jav-english-su...-during-a-visit-to-my-parents-momoka-nishina/

And if the person had took the sub had read the summary, they would know...
I post only the short version here https://www.akiba-online.com/thread...not-a-sub-request-thread.1466451/post-4552511.

View attachment 3007847View attachment 3007848View attachment 3007849NOT the full sub. Which I decided not too lol

And the admin there say he created it hahaha :hehe:
i do hardsub and after i share it on torrent community for 1 day
all my hardsub have been found on 2 websites
funny as hell, they make money from my hard work.
there is seriouse talking about this in my little forum.
somebody said when i share it on the net so then its free for eveybody.
5 years ago , i make subtitle from english subtitle ,
its not free , we hired people who know japanese for making subtitles.
i do it for my friends in that forum.
how come they said its free for everybody.
nowsday...
its worst.
somebody just uoload thing on the net because they can make money from it
4 posts for 1 dollar and such .
i dont think this people are websites owner.
i guess they dont even watch what they upload on the net because they dont have time for that .
thing as,,, they are sitting here and that forum .... looking at that topic every 3 hour and got some subtitle ....
 

Taako

Akiba Citizen
May 25, 2017
1,335
940
i do hardsub and after i share it on torrent community for 1 day
all my hardsub have been found on 2 websites
funny as hell, they make money from my hard work.
there is seriouse talking about this in my little forum.
somebody said when i share it on the net so then its free for eveybody.
5 years ago , i make subtitle from english subtitle ,
its not free , we hired people who know japanese for making subtitles.
i do it for my friends in that forum.
how come they said its free for everybody.
nowsday...
its worst.
somebody just uoload thing on the net because they can make money from it
4 posts for 1 dollar and such .
i dont think this people are websites owner.
i guess they dont even watch what they upload on the net because they dont have time for that .
thing as,,, they are sitting here and that forum .... looking at that topic every 3 hour and got some subtitle ....
Yes. So somebody here is taking subs and trying to make money here. I don't care but I won't help them.

That's why I don't share full subs anymore.
I only share subs that I clean up, find, and I ALWAYS give credit when I can.

My personal subs only go to people I know and trust :)
 
  • Like
Reactions: CoolKevin and mei2

Imscully

Well-Known Member
Apr 1, 2014
361
640
Looks like someone is taking subs here and posting to their site or what ever. And they do not give credit to the person who created the sub(s) lol :rolleyes:

My recent sub of GG-016 was found here, https://javluna1.com/jav-english-su...-during-a-visit-to-my-parents-momoka-nishina/

And if the person had took the sub had read the summary, they would know...
I post only the short version here https://www.akiba-online.com/thread...not-a-sub-request-thread.1466451/post-4552511.

View attachment 3007847View attachment 3007848View attachment 3007849NOT the full sub. Which I decided not too lol

And the admin there say he created it hahaha :hehe:
I have become increasingly disturbed by sites that steal others work without even giving credit or reference to the original author.
 

Chuckie100

Well-Known Member
Sep 13, 2019
721
2,818
venu790pl.jpg


I found machine subs for VENU-790 starring Chisato Shouda. The sub was pretty good for a machine translation. I cleaned it up a bit and tried to interrupt what was being said on some lines, but otherwise did not try to twist the story-line. Anyway enjoy and let me know what you think.
 

Attachments

  • VENU-790 Chisato.rar
    7.9 KB · Views: 294

Taako

Akiba Citizen
May 25, 2017
1,335
940
I have become increasingly disturbed by sites that steal others work without even giving credit or reference to the original author.
Yes, its disturbing and really there's no excuse, unless the original author can not be found.

I would not care if money wasn't involved, but clearly some bad guys are taking other ppl work and using it to make money. They think because its the internet they are free to break the laws.

But actually its illegal. I don't point fingers but at least I'm not trying to make money by stealing.
 
  • Like
Reactions: Imscully

mei2

Well-Known Member
Dec 6, 2018
246
406
Just a quick update: Don't use downsampled Mono with Vrew. I've only checked the Korean language extractor, but if you make it mono and 16Hz you'll get roughly a third less text in the file. I tried a mono file at 48Hz and it worked almost as well as an unadjusted stereo track. Amplify it in parts and you might catch some more dialog but the program tends to interpret amped content differently. Also, I tried a compressed file (Threshold -16db, Noise Floor: -35 db) and didn't see any real improvement.
All attempts at removing all sounds but the dialog have resulted in sound with a changed character. Either muffled or 'glitchy' and clipped. There is background noise some of the time but the most difficult parts are when there's music that's as loud as the dialog.
Thank you. Very helpful !
 
  • Like
Reactions: ericf

Imscully

Well-Known Member
Apr 1, 2014
361
640
Yes, its disturbing and really there's no excuse, unless the original author can not be found.

I would not care if money wasn't involved, but clearly some bad guys are taking other ppl work and using it to make money. They think because its the internet they are free to break the laws.

But actually its illegal. I don't point fingers but at least I'm not trying to make money by stealing.
Exactly.
I'm still a bit perturbed that new-jav blatantly stole my 100% original project for BKD-97, removed my name, and posted it on their site which generates revenue from use of filejoker links and sales.
It's important to remember that I published it in 2020 after having worked on it on and off for 2-3 years.
I don't know Japanese, so I created every line of that subtitle project myself. I didn't use translation software or Google translate. I evaluated and re-evaluated every word they said in combination with their tone of voice, facial expressions, other body language signs, and fit within the obvious greater theme of the movie.
I have no idea exactly how many hours I put into that project, but I can safely say that it was well over 100, if not 200 total hours. I loved the movie so much that I kept going back over each and every line to ensure that it was all coherent and consistent. That's not easy. It's actually really hard, but I enjoy the challenge. It's like putting a puzzle that has many interchangeable pieces together. One wrong piece can send you back to the drawing board.
I don't mean to complain, as I've said many times that I am happy to know that people are enjoying my art, but it does frost my tail to know that they are making money off of it and that they INTENTIONALLY removed any sign of the original source.
I may have to start putting a tag or key phrase known only to me into each project. I could just go the route of hard coding them into movie files, but that has it's own set of downsides.

Anyway....I have continued to work on a variety of movies, but haven't buckled down to actually finish one to perfection since SDMF-19. I really need to just pick one and knock it out to share with all my perverted friends here. Earlier today I was working to piggy back on Taako's partial translation of JUC-514. It's a really hot movie, so I think I will devote a few hours to get that done in the next day or two. (While giving credit to my friend Taako for his initial work and inspiration.)
I appreciate all those who contribute to this fun, pervy forum.
Cheers.
 
Last edited:

CoolKevin

Nutcase on the loose
Staff member
Super Moderator
Mar 30, 2007
10,005
3,633
I have become increasingly disturbed by sites that steal others work without even giving credit or reference to the original author.
It has been going on since the internet started, I might even be able to make a single thread with a million + complaints from this site alone

sadly, it will always happen
 
  • Like
Reactions: Taako and Imscully

SamKook

Grand Wizard
Staff member
Super Moderator
Uploader
May 10, 2009
3,755
5,154
If you do something for fun or a small community and publish it on some public forum without any kind of license, then it is perfectly legal for anyone else on the internet to use without credit, even if it's very rude.

You also do need to realize that it's kind of hypocritical to complain about it since most of us are likely doing the same thing with JAV videos, downloading them for free and potentially earning from them for those who share back.

I spent a few years in an anime fansubbing group so I know how much work goes into making subs but you just gotta take it as more people get to enjoy your work and the content it's for, even if you don't get the credit. Most people wouldn't know who you are anyway outside of this forum anyway.

Do what you do for yourself and the few others that appreciate it and don't bother with anything else as long as those few appreciate it and you're having fun/learning from it is my personal philosophy. As long as you're not doing it for profits, nothing changes for you other than more people get to enjoy your work.