Skip to main content

Merge audio and video files

In individual recording mode, Agora Cloud Recording generates one audio and/or video file for each user ID. If you want to merge each user ID's audio and video files into one file, you can use Agora Format Converter Script.

Prerequisites

Recommended systems for transcoding:

  • Recommended systems for transcoding:
    • Ubuntu 14.04+ x64
    • CentOS 7.0+ x64
    • Debian 8.0 +
  • Python 2, version 2.7 or later (Except Python 3).

File preparation

  • Use Agora Cloud Recording to generate the audio and/or video files. The transcoding will fail if any of the recorded files is missing.
  • Ensure that the recorded files are stored in an accessible directory.

Transcoding steps

1. Get the Agora Format Converter Script

Download the Agora Format Converter Script and decompress it. Find ffmpeg.tar.gz and convert.py. Decompress ffmpeg.tar.gz, and make sure it is in the same directory as convert.py.

To avoid compatibility issues, Agora suggests that you use the FFmpeg binary in the zipped file or install FFmpeg 3.3.

2. Execute the Agora Format Converter Script

You can start transcoding the files after you set the parameters.

Transcoding options

You can get the options with the python convert.py command.

Or you can refer to the following table to learn about the transcoding options:

OptionDescriptionRequired or not
-fSpecify the directory of the recorded files to be transcoded. If the directory contains files generated during several recording instances, the Agora Format Converter Script transcodes all the files by the order of the recording instances.YES
-mSet the transcoding mode: See -m options for details.0: (Default) Merge the audio and video files of the same user ID based on the recording segment.1: Merge the audio and video files of the same user ID into one file.2: Merge the audio files of the same user ID into one audio file.3: Merge the video files of the same user ID into one video file (no audio).NO
-sSet whether or not to include the period when a user is absent from a channel. If you do not set -s, which is the default mode, then if a user leaves the channel and rejoins it later, the period when the user is absent is included and displayed as the last frame before the user leaves. For example, if a user is in the channel for 2 minutes, then leaves the channel for 30 minutes, and rejoins the channel for another 2 minutes for a total recording time of 34 minutes, the recording will display the last frame before the user leaves for 30 minutes. If you set -s, then if a user leaves the channel and rejoins it later, the recording will not include the period when the user is absent. Do not set this parameter when -m is 0.NO
-pSet the frame rate of the transcoded video. The default rate is 15 fps. The value range is from 5 fps to 120 fps. If -p is less than 5 fps, the video will be transcoded with the frame rate of 5 fps. For more details, see How do I set the video profile of the recorded video?NO
-rSet the resolution of the transcoded video in the format of “width height”, for example, -r 640 360. For more details, see How do I set the video profile of the recorded video?NO
-aWhile transcoding audio and video, generate the user's audio file, the file name format is: user ID_timestamp.m4a.NO
-cSet the screenshot interval (millisecond), the generated file name format: user ID_timestamp_index.jpg.NO
-uSpecify the user ID to be transcoded; if not specified, all user IDs in the folder are transcoded.NO
-bSpecify the transcoding start timestamp. The unit is seconds and supports floating point numbers. For example -b 10.500 means starting from 10 seconds and 500 milliseconds.NO
-tSpecify the length of time to be intercepted. This option must used with -b ; it cannot be used alone. For example, -b 10 -t 5 means to intercept 5 seconds of data from the 10th second.NO
-oSpecify the output file name, and only generate the specified file name when -u is valid; if not specified, generate the default file name.NO
-eTranscode the recorded ts file into a standard format ts file, and output the MP4/M4A file with the same name.NO

-m options

Before introducing the -m options, we'll first explain the concept of a recording segment. A recording segment starts when Agora Cloud Recording starts recording a user ID and ends when Agora Cloud Recording stops recording a user ID.

Agora Cloud Recording starts recording a user ID only when all the following conditions are met:

  • The user ID joins the channel and starts sending audio or video streams.
  • You start Agora Cloud Recording

Agora Cloud Recording stops recording a user ID as long as one of the following conditions is met:

  • The user ID stops sending streams and does not send streams again within 15 seconds
  • The user ID leaves the channel and does not rejoin within 15 seconds
  • You stop Agora Cloud Recording

See Example for more information about the recording segment.

Now, let's see the behaviors of different -m options:

It is assumed that each user ID in the channel generates several audio files and video files.

  • 0: Merge the audio and video files of the same user ID based on the recording segment. One recording segment of one user ID corresponds to one merged file, named as user ID_timestamp_av.mp4. In the filename, timestamp is the time when Agora Cloud Recording starts recording. If Agora Cloud Recording starts to record the audio and video at different times, timestamp is the earlier one of the two starting times. The time zone of timestamp is UTC+0. The timestamp parameter consists of the year, month, day, hour, minute, second, and millisecond. For example, 100_20190611073246073_av.mp4 is a merged file for user ID 100 and the start time of the recording is 07:32:46.073 a.m., June 11, 2019.

  • 1: Merge the audio and video files of the same user ID into one file. One user ID corresponds to one file, named as user ID_0_merge_av.mp4.

  • 2: Merge the audio files of the same user ID into one audio file. One user ID corresponds to one file, named as user ID_0_merge.m4a. The file uses AAC encoding. You can use the Agora Format Converter Script to transcode the file to other formats, such as MP3, see Convert steps for details.

  • 3: Merge the video files of the same user ID into one video file (no audio). One user ID corresponds to one file, named as user ID_0_merge.mp4.

  • If the transcoding mode is -m 0, and the -b parameter and -t parameter are used, the -m parameter is forced to 1, and the slice transcoding configuration is invalid.

  • If the -b parameter and -t parameter are used, the time period is added to the file name after transcoding. For example: The file name of python convert.py -f <path of the file to be transcoded> -m 1 -b 10.5 -t 5.5 is convertered to 123_0_merge_10.500_5.500_av.mp4 after transcoding.

  • Switching between different AudioProfile configuration at the streaming sender during a recording is not supported.

Example

The following examples show different merged files with different -m settings.

Suppose that two users, with user IDs 100 and 125, stay in the channel, and you start Agora Cloud Recording in individual recording mode. user ID 100 leaves the channel and rejoins the channel in 30 seconds, thus causing two segments in this recording instance. When the recording ends, user ID 100 has four recorded files: one audio file and one video file for each of the two segments. user ID 123 has one audio file and one video file.

To merge the audio and video files of the same user ID into one file, use the following command:


_1
python convert.py -f <directory of the files to be transcoded> -m 1 -s -p 30 -r 640 360

Two files are generated after transcoding: 100_0_merge_av.mp4 and 123_0_merge_av.mp4. The 30 seconds when user ID 100 is absent is not included in the merged file. To include the time interval, do not set -s.

To merge the audio and video files of the same user ID based on the recording segment, use the following command:


_1
python convert.py -f <directory of the files to be transcoded> -m 0 -p 30 -r 640 360

Three files are generated after transcoding: 100_timestamp1_av.mp4, 100_timestamp2_av.mp4, and 123_timestamp1_av.mp4.

A convert-done.txt file is generated after the transcoding is complete. Once the Agora Format Converter Script is used, a convert.log file is generated in the same directory as the audio and video files upon completion of the transcoding.

The transcoded MP4 file supports most mainstream media players. See the following table for details:

Operating SystemMedia Players
Linux
  • VLC
  • ffplayer
  • Windows
  • Media Player
  • KM Player
  • VLC Player
  • Chrome (49.0.2623 or later)
  • macOS
  • QuickTime Player
  • Movist
  • MPlayerX
  • Chrome (49.0.2623 or later)
  • Safari (11.0.3 or later)
  • iOS
  • iOS default player
  • KMPlayer
  • Safari (9.0 or later)
  • Android
  • Android default player
  • MXPlayer
  • VLC
  • KMPlayer
  • Chrome (9.0.2623 or later)