broken: start new speech-to-text
wrong language code:
ffmpeg version 4.3 Copyright (c) 2000-2020 the FFmpeg developers built with gcc 7.3.0 (crosstool-NG 1.23.0.449-a04d0) configuration: --prefix=/opt/conda --cc=/opt/conda/conda-bld/ffmpeg_1597178665428/_build_env/bin/x86_64-conda_cos6-linux-gnu-cc --disable-doc --disable-openssl --enable-avresample --enable-gnutls --enable-hardcoded-tables --enable-libfreetype --enable-libopenh264 --enable-pic --enable-pthreads --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libmp3lame libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'http://10.14.14.86:9000/vortanz-beta/4d70c1ed-a2f0-466e-8a0a-4d68b097d5c6/22619f531f09556ca44344a6b545cb51-standard.mp4?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=minio%2F20231116%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20231116T145826Z&X-Amz-Expires=86400&X-Amz-SignedHeaders=host&X-Amz-Signature=10dcb1ac6a371021e561183d81e3cc97513ca911d7e842f21379b48c71dce487': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf59.27.100 location-eng : +50.9336+006.8693/ location : +50.9336+006.8693/ Duration: 00:00:17.12, start: 0.000000, bitrate: 2789 kb/s Stream #0:0(eng): Video: h264 (Constrained Baseline) (avc1 / 0x31637661), yuv420p(tv, bt709), 1600x720, 2654 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandle encoder : Lavc59.37.100 libx264 Stream #0:1(eng): Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, stereo, fltp, 129 kb/s (default) Metadata: handler_name : SoundHandle Stream mapping: Stream #0:1 -> #0:0 (aac (native) -> pcm_s16le (native)) Press [q] to stop, [?] for help Output #0, wav, to '/workspace/22619f531f09556ca44344a6b545cb51-standard.wav': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 location : +50.9336+006.8693/ location-eng : +50.9336+006.8693/ ISFT : Lavf58.45.100 Stream #0:0(eng): Audio: pcm_s16le ([1][0][0][0] / 0x0001), 44100 Hz, stereo, s16, 1411 kb/s (default) Metadata: handler_name : SoundHandle encoder : Lavc58.91.100 pcm_s16le size= 2940kB time=00:00:17.10 bitrate=1408.0kbits/s speed= 306x video:0kB audio:2940kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.002591% Traceback (most recent call last): File "/workspace/SpeechWhisper.py", line 353, in <module> main() File "/workspace/SpeechWhisper.py", line 348, in main result = sw.predict(args) File "/workspace/shared/model_wrapper_util/metrics.py", line 155, in __call__ result = self.func(self.instance_, *args, **kwargs) File "/workspace/SpeechWhisper.py", line 292, in predict segments, info = self.model.transcribe(v['audio'], File "/opt/conda/lib/python3.8/site-packages/faster_whisper/transcribe.py", line 332, in transcribe tokenizer = Tokenizer( File "/opt/conda/lib/python3.8/site-packages/faster_whisper/tokenizer.py", line 29, in __init__ raise ValueError( ValueError: 'French' is not a valid language code (accepted language codes: af, am, ar, as, az, ba, be, bg, bn, bo, br, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fo, fr, gl, gu, ha, haw, he, hi, hr, ht, hu, hy, id, is, it, ja, jw, ka, kk, km, kn, ko, la, lb, ln, lo, lt, lv, mg, mi, mk, ml, mn, mr, ms, mt, my, ne, nl, nn, no, oc, pa, pl, ps, pt, ro, ru, sa, sd, si, sk, sl, sn, so, sq, sr, su, sv, sw, ta, te, tg, th, tk, tl, tr, tt, uk, ur, uz, vi, yi, yo, zh)