Good job picking up the dialogue over music -- I think it has done a much better job in that condition than Autosub.
As a comprison I ran autosub on the same movie. Here is the ja-JP srt result.
I also ran the NAVER translator on both versions: your aws version and the autosub -just to compare. It looks to me that autosub does a slightly better job in quiet or hiss noise. But AWS does much better job in noisy situatuion.
As a curiosity I also ran pyTranscriber on a portion of the video. I kind of thought pyTranscriber did a better job than autosub though it is supposed to be using autosub as its core engine