I'm a software engineer with nearly a decade of experience. Today I figured out how to use Google Cloud Compute to transcribe and translate JAVs pretty reliably using some engineer's open source project on GitHub I found.
Rough estimates but...
From start to finish it takes me about 90 seconds of manual manipulation and perhaps 10 minutes of waiting to generate an SRT file.
If I batch translate multiple at once I can done one after the other after the other.
I'm still on Google Compute Cloud's free trial lasting 90 days and offering $300 of credit, but by my rough estimate it costs maybe
$0.10-0.25 USD in API costs to translate a 2 hour video. Totally pulling that number out of my ass but the point is it's
cheap.
Hope to record and show a demo of this soon. I know I have like zero presence on this Forum but I've had this account for a long time and built most of my JAV collection here. So... feel like sharing in some of the amazing good fortune.
Update 1 - 2022-09-10
It's late and I got way too into this tool. I translated about 70-80 hours or so of JAV material and it cost me about $21.80 in credits. That averages about $0.30 USD an hour. So, my estimate was off, but it's still pretty cheap. Again, I've got $300 of complimentary credit. So whatever.
I might not be able to record a demo of the thing working for a couple of days (I realize I'm going to need to do some editing to hide persona details from a screen recording I hope to make). I have a day job. But I hope to share some updates on how I might translate for people with cool JAVs. I'll give details, but with a free tool on PC/MacOS/Linux anyone should be able to get me the material to perform a translation with a simple share of an audio-only mkv that takes mere moments to create and will only be a like 100mb to send to me.
I'll figure out a method to share the SRTs I've created thus far.
Update 2 - 2022-09-10
These are all entirely random. And most of them are probably closer to a decade old than not. What can I say? I don't download JAVs as often as I used to and I'm pretty content with the library I built in my early 20s. Lots of incest and kinky stuff.
If someone viewing this happens to potentially have one of these videos, give it a try!
Even if you don't download a copy anyway.
For any engineers out there. Have at it:
TreeAntSan/trans_video_subs: Auto Translate and add subtitle to video on Google Cloud (github.com)
This is seriously advanced stuff. So... uhhh... not going to trouble shoot your issues, so don't even ask. Wait until I reply again.
Update 3 - 2022-09-10
Haha, holy shit. Never mind about the price prediction. My credit count got updated. It's looking like I'm getting charged about a $1 USD an hour. Still cheap. But this evening of rallying through a bunch of JAVs cost $80 USD. Hoo boy. Still pretty worth it.
Gotta say, top of all. Only going to get fun results with JAVs with roleplay or story. There needs to be a lot of talking. Otherwise you're liable to pay good money for just "Ohh it's so big" or "don't stop"... which isn't fun.
Update 4 - 2022-09-11
More SRT files shared. Now 39 in total (7 more than yesterday).
I've added a slew of bash scripts (scripts you can use in Linux and MacOS) that remove most of the repetitive tasks that'd otherwise require manipulation with MKVToolNix GUI.
Update 5 - 2022-09-11
Oh, and I also discovered a way to cut my
Speech-to-Text API costs (the greatest source of cost) down by about 66%. If you let Google perform analysis on your uploaded audio they give you a 33% discount. And they also charge per channel. So a stereo audio track doubles the cost. So, I obviously adjusted the tools to mix down all audio to mono. That's makes the API 66.67% cheaper to use.
From $0.012 USD per 15 seconds to $0.004 USD per 15 seconds. That's equivalent to dropping a 2hr JAV from costing $5.76 USD ((2*60*60/15) * 0.012) to $1.92 USD ((2*60*60/15) * 0.004).
I'll have to trust the price drop will come, I'm only fairly certain I've dropped the price 66%.
Next comes translation. While each transcript's length varies, I imagine we can expect anywhere from 7,000 to 15,000 characters to be used per video (this is a big guess-timate, I don't currently have any samples of actual counts, really just doing word-counts of English after-the-fact).
Translate API costs come to $20 USD per 1,000,000 characters translated (incoming). It helps that Japanese consumes fewer characters than English. So if a video had 10,000 Japanese characters that'd be a pretty humble $0.20 USD.
So, a two hour video would probably cost $2.12 USD or so to complete. I checked the cost of the Cloud Run instance and it's really quite small. In generating 39 videos' worth of translations, the cost of the computing has only come to $5.69 USD so far, which is
tiny.