Preprocess Module

audiovisually.preprocess.mp3_to_text(audio_path, api_key=None, engine='assemblyai', whisper_model='base', timestamps=False)

Transcribe an audio file to text using AssemblyAI or OpenAI Whisper.

Parameters:
  • audio_path (str) – Path to the audio file.

  • api_key (str, optional) – AssemblyAI API key (required if engine=’assemblyai’).

  • engine (str) – ‘assemblyai’ or ‘whisper’ (default ‘assemblyai’).

  • whisper_model (str) – Whisper model size (default ‘base’).

  • timestamps (bool) – If True and engine=’whisper’, return timestamps.

Returns:

DataFrame with sentences (and optionally timestamps) or error message if transcription fails.

Return type:

pd.DataFrame or str

Example

>>> from audiovisually.preprocess import mp3_to_text
>>> df = mp3_to_text("audio.mp3", api_key="your_api_key", engine="assemblyai")
>>> df = mp3_to_text("audio.mp3", engine="whisper", timestamps=True)
audiovisually.preprocess.translate_data(data, source_lang='auto', dest_lang='en', text_column='Sentence', translated_column='Translation')

Translate text in a DataFrame column, a list of strings, or a single string to a target language.

Parameters:
  • data (pd.DataFrame, list, or str) – DataFrame, list of strings, or single string to translate.

  • source_lang (str) – Source language code (default ‘auto’).

  • dest_lang (str) – Destination language code (default ‘en’).

  • text_column (str) – Name of the column with text to translate (used if data is DataFrame).

  • translated_column (str) – Name of the column to store translations (used if data is DataFrame).

Returns:

DataFrame with translations, translated string, or list of translations.

Return type:

pd.DataFrame, str, or list

Example

>>> from audiovisually.preprocess import translate_data
>>> df = pd.DataFrame({"Sentence": ["Hola", "Bonjour"]})
>>> translated_df = translate_data(df, dest_lang='en')
>>> translated_str = translate_data("Guten Morgen", dest_lang='en')
>>> translated_list = translate_data(["Ciao", "Hallo"], dest_lang='en')
audiovisually.preprocess.video_to_mp3(video_path, output_path=None)

Convert a video file to MP3 audio.

Parameters:
  • video_path (str) – Path to the input video file.

  • output_path (str, optional) – Path to save the output MP3 file. If None, saves in the same folder as input.

Returns:

Path to the generated MP3 file, or error message if conversion fails.

Return type:

str

Example

>>> from audiovisually.preprocess import video_to_mp3
>>> mp3_path = video_to_mp3("input.mp4")