akiba resident JAV subtitlers & subtitle talk★NOT A SUB REQUEST THREAD★

mei2

Well-Known Member
Dec 6, 2018
242
403
.... I was able to find an a few leftover words, I was planning to add to that list.
They are below.

I have begun another list, so I decided to share before I accidently delete again:)

Thanks @Taako , this was very timely. I was looking for a glossary that I could use --that can help to improve Whisper translations.
 
  • Like
Reactions: Taako and Imscully

TmpGuy

JavLuv author, lesbian connoisseur
Jun 1, 2013
742
988
I started a list of words and phrases, maybe 2 years ago, of commonly used words in JAV.
That list must had over 3000 words and phrases, if not more.

It consist of different combinations of words, idioms, some repetitive words, blah, blah,
With some help from true Japanese friends/speakers.

Remember these are words sometimes I hear... and considering my hearing is bad lol
I think you might hear differently or know the better words.

Share and share a like

I've used the same sort of hint sheet. Same caveats apply about correctness and accuracy. Here's mine (CSV format), in case it has anything you missed:
 

Attachments

  • JAV Translation Notes.zip
    2.5 KB · Views: 91

Taako

Akiba Citizen
May 25, 2017
1,325
933
I've used the same sort of hint sheet. Same caveats apply about correctness and accuracy. Here's mine (CSV format), in case it has anything you missed:
I can' t wait to see your list. I'm sure it's awesome.:)
 

Taako

Akiba Citizen
May 25, 2017
1,325
933
Thanks @Taako , this was very timely. I was looking for a glossary that I could use --that can help to improve Whisper translations.
You're welcome. My plan is to add to the list and... I hope others do as well.
This could make things easier for all us subbers.
 

julian102

New Member
Feb 16, 2022
11
18
ok i recently found https://www.nikse.dk/subtitleedit which has a built in whisper option (im using the model faster-whisper to transcribe faster), so I was basically able to 1 click subtitle my jav videos from audio only. I think the quality personally is very very good for machine translation, and it at least gives a good gist of the general dialogue in all the videos. imma post a couple srt files ive done with these. all these in total took like 30-40 hours i think.

i'll list them here if incase anyone looks for them and is searching for them. (a minority of scenes were cut by me, so the translation only got to those part. if u use the srt with the full video, it will not cover every audio)

HUNTA-881, HUNBL-148, HUNTA-429, HUNTA-406, HUNTA-180, HUNTA-131, HUNBL-133, HUNBL-126, HUNBL-109, HUNBL-082, HUBLK-012, HUBLK-011, FSDSS-632, HND-395, HBAD-390, HBAD-351m HBAD-338, GVH-610, GVH-597, GVH-596, GVH-493, GVH-472, GVH-003, GVG-873, GS-2054, GDHH-200, DVRT-018, DVRT-017, DVDMS-075, DVDES-926, DVDES-692, DVDES-591, DSE-1120, DISM-028, DISM-017, DISM-017, DISM-028, DISM-011, DASS-244, DANDY-884, DANDY 858B, DANDY-596, DANDY-536, DANDY-505, DANDY-496, DANDY-335, DANDY-102, CRPD-436, CRPD-194, CAWD-539, CAWD-365, BWSD-23, ATOM-227B, ATOM-227A, ATID-535, AQSH-050, ALDN-227, HEYZO-0841, DVDMS-010, VRTM-171, NHDT-695
 

Attachments

  • srt.rar
    1.1 MB · Views: 199

iko190806

New Member
Mar 16, 2024
11
3
Wow this is a gold mine ..
I started a list of words and phrases, maybe 2 years ago, of commonly used words in JAV.
That list must had over 3000 words and phrases, if not more.

All of which is hiragana.

It consist of different combinations of words, idioms, some repetitive words, blah, blah,
With some help from true Japanese friends/speakers.

Couple of months of ago by accident, using notepad 1, i delete the entire thing.
It did not dawn on me until a few days later what I had done.
I was more angry with myself than sad.

So I was able to find an a few leftover words, I was planning to add to that list.
They are below.

I have begun another list, so I decided to share before I accidently delete again.

What I present to you will not be 100% accurate and you are
free to change and/or add to the list. So others can use it as well.

Also, while it's good to have a different meanings to the word/phrase, it can be
frustrating to know which is more correct. So I try to keep it simple as best I can:)

Remember these are words sometimes I hear... and considering my hearing is bad lol
I think you might hear differently or know the better words.

Share and share a like.
 
  • Haha
Reactions: Taako

Taako

Akiba Citizen
May 25, 2017
1,325
933
While I'm working on more subs, here is a useful word.
I hear it from men in JAV more than women but they say it :)

1. Umai(うまい) = Good. It's good. It's delicious. Tasty, etc.
(E.g., Good at something = You're 'good' at sports. )
(E.g., Also, synonym 美味しい= Delicious. The meal is 'delicious')


One more for now... and believe me it is used in JAV, although randomly.
So add it:p

1. Nande sore mo anna jikan ga saisho niitsu ta hou ga ii
(なんでそれもあんな時間が最初に言った方がいい) =
'But why it should be better at that time to begin with'.


Ok, that was a long one, how about two more... we all hear a lot...

1. Ō, Sugoi = Oh(Ah) Sugoi(おおすごい ) =
Oh, wow. Wow. Oh, great. Amazing.
Wow, it's amazing. That is amazing, etc.

2. Soko(そこ) = There. Here. That place. Right there.
That part, etc.

That's it for now.
Remember, if some words are wrong or you want to add subs, please do.
Let's all make a list.
 
Last edited:
  • Like
Reactions: iko190806 and mei2

mei2

Well-Known Member
Dec 6, 2018
242
403
This is a cross-post from another thread:

Some time ago I started collecting any JAV glossary that I could find. Here is the collection so far:

JAV Glossary Version 3

Note: I don't speak Japanese and have not vetted the correctness of all entries that I have collected.
Note: there are duplicates that I haven't cleaned up yet.
Note: if you detect any errors please let me know.
 

Fetterbr

Member
Dec 10, 2010
84
5
This is a cross-post from another thread:

Some time ago I started collecting any JAV glossary that I could find. Here is the collection so far:

JAV Glossary Version 3

Note: I don't speak Japanese and have not vetted the correctness of all entries that I have collected.
Note: there are duplicates that I haven't cleaned up yet.
Note: if you detect any errors please let me know.
how do you use this in a translator. i use edit subtitle app
 

joowonton

New Member
Jan 28, 2024
3
8
Hey all! I'm new here and I was wondering if any one wants to partner with me in developing high quality english subtitles for JAV videos.

A bit about me, I work professionally as a software engineer, I'm mostly a backend engineer so I write data pipelines for a living. I was wondering if any one of you is well versed in ML/AI so we can fine tune whisper models.

I can create web tools and infrastructure to support the research, for example I created https://www.simplejav.com/ (work in progress) so users can easily watch JAV on mobile and have access to English subtitles. (Currently it's just simple Whisper AI for the english subtitles, so it's pretty rudimentary and inaccurate, but it's better than nothing).

Or if you speak Japanese! Let's collaborate :D. I am looking for any help I can get, I want to create the best experience watching JAV as a english speaker.
 

mei2

Well-Known Member
Dec 6, 2018
242
403
Hey all! I'm new here and I was wondering if any one wants to partner with me in developing high quality english subtitles for JAV videos.

…. I was wondering if any one of you is well versed in ML/AI so we can fine tune whisper models.…

Hi @joowonton, I’m not an ML engineer but I’ve done a fair bit of playing around with Whisper. I’d be keen on any collaboration to improve and automate for better quality subs. My public repo is here: WhisperJAV.

What I’ve found most difficult in improving subs (and fine-tuning Whisper) is having a reliable test data set. I have tried to create my own data set from subbed anime and Netflix, but none are at a satisfactory level. The best so far has been Netflix’s Naked Director—I specifically use the episode where Nanami Kawakami appears, as her voice is quite the same as in her JAV works (and she is my top favorite ;)).

I personally think, to begin with, the most valuable partner would be a Japanese-speaking person who could help (you/us) in creating/validating a test data set. For example, a set of 5-minute audio/subs from the top 20 actresses each would go a long way. So I hope some Japanese-speaking member here will reply to your call.

Hope to hear from other members.
 
  • Like
Reactions: Taako and Imscully

joowonton

New Member
Jan 28, 2024
3
8
Hi @joowonton, I’m not an ML engineer but I’ve done a fair bit of playing around with Whisper. I’d be keen on any collaboration to improve and automate for better quality subs. My public repo is here: WhisperJAV.

What I’ve found most difficult in improving subs (and fine-tuning Whisper) is having a reliable test data set. I have tried to create my own data set from subbed anime and Netflix, but none are at a satisfactory level. The best so far has been Netflix’s Naked Director—I specifically use the episode where Nanami Kawakami appears, as her voice is quite the same as in her JAV works (and she is my top favorite ;)).

I personally think, to begin with, the most valuable partner would be a Japanese-speaking person who could help (you/us) in creating/validating a test data set. For example, a set of 5-minute audio/subs from the top 20 actresses each would go a long way. So I hope some Japanese-speaking member here will reply to your call.

Hope to hear from other members.
Yea that's what I thought. I'm excited to check out your repo and see the work you have done so far. I know there's some phrases in a public dataset we might be able to translate and then use as training:

Though I'm not sure how relevant these phrases are to apply to JAV. I've DMed you about some other possible techniques, so I won't bore this thread.
 
  • Like
Reactions: Taako and mei2

stanocrick

New Member
Aug 5, 2024
2
0
Hi .. why subtitles edit is horrible at generating subtitles, and translation makes it even worse.. like not being able to generate subtitles if BG music exists. Can you suggest me alternative applications