CST RT 040
CST RT 040
This document specifies the technical constraints applicable to the production of HD files for
RFB (Ready for Broadcast) acceptance of programmes delivered to television channels in
France.
For audio and video technical specifications, the scope of this document concerns all programme elements
delivered to Broadcasters: trailers, sponsorship, TV movies, documentaries, series, jingles, advertising, feature
films, short films, live broadcasts, etc.
The requirements of Broadcasters concerning the content and nature of delivery media are not mentioned in this
document, except for identification signals, because it was decided that these should be specifically defined by
each Broadcaster depending on their own requirements and the contractual framework specific to each channel
and each programme.
The amendment of these recommendations (by increments to the version number) takes place according to the
type and nature of the modifications made to this document. Considering that the version number is in the form
A.B:
A will change in increments of 1 in the event of the addition of major functional modifications affecting the
product delivered
B will change in increments of 1 in the event of corrections, additional specifications etc.
Any amendments to this document must be validated by all the parties who took part in drafting this version.
Paragraph titles
References to other standard documents or recommendations whose content must necessarily be observed
within the framework of this document
Elements to which particular attention must be paid by all parties for various reasons (e.g. new elements in
the standard entailing changes to customary work procedures).
1.4.3 - Audio
ITU-R BS.1770-4: Leq RLB audio measurement algorithms
ITU-R BS.775 : Layout of 5.1 speakers
UER-R91-1998 : Allocation of channels for 5.1 audio
EBU-R128 and R128s1: Loudness normalisation and permitted maximum level of audio signal
EBU-Tech 3341, 3342, 3343, 3344 : technical recommendations relating to EBU R128 and R128s1
EBU Tech 3276 and 3276s1 : Listening system calibration
EBU Tech 3304 : 5.1 audio test signals
SMPTE 299M : 24-byte audio for HD TV
IEC 60268-5 : Audio listening
16/9 Video
Image format First line End line Number of lines
1,33 21 560 540
1,37 21 560 540
1,66 21 560 540
1,77 21 560 540
1,85 32* 550* 519*
2,35 87* 495* 409*
2,39 90* 492* 403*
This value may be modified if the owners of the rights validate a request for specific re-framing.
Horizontally, the SAV (Start of Active Video) must therefore be immediately followed by useful content. A gap of
0.1 µs between the SAV and the start of the useful content is tolerated. For a given programme, this gap – if there
is one – will be constant for the duration of the programme.
In accordance with recommendation EBU R 103 (v2.0 June 2016) and in order to guarantee that the color gamut
is respected, we recommend that the digital values of the RGB components, as well as the Y value of
corresponding luminance should, necessarily lie between the minimum and maximum values, i.e. 5-246 in 8 bytes
(see table below).
The proportion of pixels whose values are outside the Color Gamut (see above table) must not exceed 1% of the
total number of pixels in the active image.
As a general rule, the image must never have an excessive sound level, shiny defects, compression
artefacts or faults due to the over-use of a noise-reduction process etc.
The darkest scenes in the programme must - in all cases – be viewable in an acceptable quality when
they are screened on a “general public” television. Black areas must not be stuck or squashed and
details must remain visible in dark areas.
Special care must be taken with quantification conversions, especially for films made using typical
« cinema » systems (codecs 10 to 16 bytes) during the conformation of the 8-byte video signal
intended for the Ready For Broadcast file.
The pre-requisites used for drafting this document with regard to the constraints and aims of sound
broadcast on television are as follows :
Respecting the work, in particular concerning artistic choices
Guaranteeing comfortable listening for viewers by means of perfectly intelligible sound in the various
configurations (all multi-channel types including 5.1, down-mix, stereo, mono)
Guaranteeing continuity of perception of sound in the sequence of different programmes
Optimising the use of metadata inherent to Dolby Digital and Dolby Digital Plus coding.
GENERAL CONSIDERATIONS
Any work done to make mixed sound tracks comply with the clauses of this recommendation must be carried
out under the authority of the owners of rights or by a service provider appointed or empowered by them. The
work done must be considered satisfactory by everyone involved (producer, post-producers, broadcasters).
The use of solutions that include the dynamic compression of the signal must not result in constant energy
levels without any “aeration” of the sound signal over time.
UNITS
LUFS means Loudness Unit on the Full Scale. A LUFS value is the expression of a weighted measurement, K
(Leq(R2LB)) on a Full Scale.
The LUFS scale is graduated in steps of 1 dB. Applying an increase of 1 dB to the measured signal results in an
increase of + 1 dB in the reading on the LUFS scale.
These sound tracks must be mixed in an auditorium where the technical equipment and dimensions are suitable
for television sound (close-up listening). For example, large cinema-type auditoria are not suitable for this kind
of mixing and they are therefore not advised.
The definition of the sound level for the “validation” of sound levels in RFB programmes in a quality check room
can be found in the clauses of recommendations EBU-Tech 3276 and 3276 s1.
The mixer may adjust the sound level (SPL acoustic pressure level) validated with reference pink noise (EBU
Technical Recommendation R68) depending on the proposed hall and sound system, in the region of 79 dB (C)
per channel (not including LFE).
Audio mixing, mastering, encoding and quality control devices must be fitted with speakers that comply with
standard IEC 60268-5.
The position of the speakers must comply with standard ITU-R BS.775.
Sound devices must be calibrated according to the rules laid down in recommendation EBU Tech 3276s1.
Programmes will be checked on sound systems whose line-up level - for pink noise broadcast at -18dBFS - is
74 dB (C) SPL on every channel except the LFE channel, which is broadcasted at 78 dB (C) SPL.
This level is considered to be the one at which the overall dynamic range and the intelligibility of speech and
timbre are deemed to comply with television broadcast levels.
PEAK LEVELS :
Levels – expressed in dBTP – are checked by means of an instant peak-meter, « True Peak ».
Current recommendations:
SMPTE 299M : ”24-Bit Digital Audio Format for SMPTE 292 M Bit-Serial Interfaces”.
ITU-R BS.1770-4
LOUDNESS
History
Further to the work carried out since 2000 within the framework of FICAM / CST work groups on listening
comfort and sound level, completed by the work done by the CST’s Sound department and the HD Forum ; we
propose a method for measuring the energy level – or loudness - which allows us to validate the value of this
level for all programme types intended for television broadcast, whatever format they are in (PCM, Dolby E,
etc.) and whatever the type of mixing (multi-channel, stereo, mono, etc.)
This value will also allow us to inform any metadata if encoding is used.
Metering method
The metering method we use consists of measuring the whole of the useful programme using a device that
implements the algorithm described in recommendation R 128 published by EBU, as well as its appendices
3341, 3342, 3343 and 3344, producing a value given in dB LUFS.
The line-up level reading on a Full Scale digital peak meter is -18dB FS for a sinusoidal signal at a frequency
of 1000Hz.
A 1000 Hz line-up signal at -18 dBFS should display a loudness level of -18 LUFS on a metering device in
EBU mode, if the signal is present on the left and right channels of a stereo programme.
If the signal is present on one channel only, the displayed level will be -21 LUFS.
As a reminder, in analogue sound: the line-up level reading on PPM quasi-peak 10 ms DIN 45406 is -9 dB and
the line-up level reading is 0 Vu on a Vu meter.
The peak level for audio signals measured using True Peak, must not exceed -3,0 dB TP.
For each version of the sound mixing (French, Multi-lingual, audio-description…) the following table
summarises the sound recommendations for suppilers.
Short term loudness range of Not measured -30,0 ≤ Dialog short term ≤ -16,0
dialogues around the target value in LUFS
of the integrated loudness
In order to maintain sufficient intelligilbility for television viewers and to take into account the considerable
variations in their listening conditions, measuring the short-term loudness in sections of the programmes that
include dialogues must target an average level of -23.0 LUFS and a maximum variation of 7 LU either side of
the target level is permitted ; -30.0 LU is equivalent to a whisper and -16.0 LU is equivalent to a shout.
Respecting the target levels for dialogues will allow the indiscriminate linear and non-linear (catch-up TV)
broadcast of programmes, a multi-screen complex leading to an increasing number of listening situations by
means of headphones, miniature speakers such as those on personal computers, tablets and mobile
telephones.
The following diagram fixes the main points of reference on the tools that measure loudness (left side) and the
maximum peak level (right side).
If a justified and very specific request is made in the comments section of the programme’s XML form, short
programmes may present an integrated loudness measurement below -23.0 LUFS.
+ 7 LU
Excursion of loudness
MAX
+ 3 LU
- 22,0 LUFS
- 23,0 LUFS * Maximum value
- 23,0 LUFS - 23,0 LUFS
+ 20
Average value - 24,0 LUFS Average value
of of
"loudness level" "loudness level"
- 7 LU
Fig 2A : Dynamic profile value Fig 2B : Dynamic profile value in "short term" Fig 2C : Peaks values Fig 2D : LRA values
Video and audio signals are synchronous with the following tolerance:
By default, tracks encoded using Dolby E must be recorded synchronously with video (In-sync).
This will result in a delay of 1 (one) audio image compared to the video, further to decoding using Dolby E
without compensating for the video delay.
Format conversions using speed variations (24/25 fps) generally require harmonisation in order to remain within
the original musical tone. Harmonisation procedures on 5.1 signals are at the limits of current technology and in
Page 10 sur 19 CST - RT 040 - TV - RfB Files v1.2 - 2016.docx
CST – 22-24, Avenue de St Ouen - 75018 PARIS - Tél. : 01 53 04 44 07 - Fax : 01 53 04 44 10 - E-mail : [email protected] - URL : http://www.cst.fr
Association loi 1901 - SIRET 78433300700036 - Code APE 743 B - Code TVA FR96784333007
some cases they are terribly destructive. Harmonisation is therefore a delicate operation that must be listened to
again and validated by a qualified person.
When the verification is carried out, it will be listened on stereo and on a multi-canal system (if necessary using
Dolby Surround decoding or Dolby E decoding). The compatibility of 5.1 multicanal programmes with stereo and
mono sound must necessarily be validated by simulating the RF film standard mode, at the very least with
verification by means of sampling for long programmes and in their entirety for short programmes.
IMPORTANT INFORMATION: During broadcast, it often occurs that the French version - when it is
available as multi-channel sound – is the channel’s main audio component. For TV viewers who are not
equipped with a 5.1 installation, this is the component that will be « downmixed » by the receiver to be
listened to in stereo or mono.
2.2.5.1 - Mono
In monophonic broadcasts, the signal recorded using PCM is delivered on two contiguous audio tracks without
digital bit rate compression. In order for this to be compatible with stereo signals, the two tracks containing the
monophonic signal are strictly identical and in time.
Assignment of tracks
In stereophonic broadcasts, the signal recorded using PCM is delivered on two contiguous audio tracks
without digital bit rate compression. The odd track is the left channel and the even track is the right channel.
Audio Phase
This is the average result between the left and right channel of an intensity and phase stereo. To ensure the
mono downmix is compatible, the phase must be mostly positive.
Programme available only with monophonic or stereophonic sound: Dolby-E coding is carried out in
2/0 configuration where the PCM version’s monophonic sound is duplicated identically respectively on
tracks 1 and 2 of the Dolby-E, and the PCM version’s stereophonic sound from the PCM is duplicated
identically track for track on tracks 1 and 2 of the Dolby-E.
Programmes available on 5 or 6 tracks mixed respectively in formats 5.0 or 5.1: the Dolby E coding is
carried out in 3/2 configuration depending on the assignment of tracks in recommendated EBU-R91-
1998,
The specifications regarding the content of tracks 7 and 8 of the Dolby E are left at the discretion of
the Broadcasters.
AES pair number Dolby E flow channel 2+2 configuration 5.1+2 configuration
(encoder entries, decoder main mix in dual-mono or main mix in
exits) stereo 5.1 / 5.0
The characteristics of the stereo signal phase derived from a downmix must fulfil the same requirements as a
traditional stereo signal (Cf. § 2.5.2).
Special case
For certain older programmes encoded in Dolby E simply after adjusting their Integrated Loudness value to -23
LUFS, it often occurs that the average level of dialogues differs greatly from this value.
In this case and in order to maintain the sound quality of dialogue phases of the programme, the sound
intensity measurement should be entered into the Dialogue Level metadata, as measured by equipment using
the Dialogue Intelligence algorithm. In such a case, this decision will be explained in the comments section of
the programme’s XML form.
NB : The Extended BSI mode must necessarily be switched on for 5.1 et 5.0 programmes. However, it
must not be switched on for stereo formats.
* Surround Phase Shift and Preferred Downmix parameters are inter-dependent. Please refer to paragraph 2.2.6.4 for their parameters.
** The Programme Name entered must not contain any characters with accents or spaces.
Figure 2: Table of Dolby-E metadata default parameters for 5.1, 5.0, 2.0, dual and mono formats
ABCDE Fixed metadata dependent solely on the « Channel mode » parameter
ABCDE Default metadata that may be modified if justified (cf. Notes)
ABCDE Metadata dependent on the mixing
3.1 - GENERALITIES
A programme delivered in the shape of a file comprises the following, whatever its length:
A single MXF Op1a file containing the video essentials and the associated audio essentials
The start of the programme (first image and first useful sound) must necessarily begin at a whole hour Time Code
(eg.. 00:00:00:00, 01:00:00:00, 10:00:00:00). The presence of lead-in elements before the first image, or after the
last useful image of the programme (black, neutral backgrounds) remain at the broadcaster’s discretion.
Delivery identifier
Programme title (and if appropriate, the sub-title or the episode number)
The duration (HH:MM:SS:II)
The Magnetic Recording number or Case number or Identifier number
This information must also be reproduced on a paper form accompanying the medium, along with the following
extra information:
Any comments that may be useful for the verification: possible image or sound defects ascertained,
occasional problems with image/sound synchronisation etc.
The various Time Code elements that an MXF file may contain must be coherent amoung themselves and must be
present in the “Material Package”, “Source Package” and the "System Item" in the MXF file.
MXF Ready to Broadcast files will have 8 audio tracks and each different track will be transported by a different
AES3 track.
Each audio track in the MXF file will carry an AES3 stream, with 24 byte audio resolution at 48 kHz sample
frequency. Two AES3 tracks / stream may – if required – transport a Dolby E encloded stream. The assignment of
the file’s 8 tracks, as well as any tracks that may be Dolby E encoded, must be made clear in the associated XML
file as per each Broadcaster’s recommendations.
Any programme that is to be broadcast with sub-titles must be accompanied by one or more sub-title files that
comply with UER standard N19-2002 (STL file), that respects the following instructions: The names of the STL files
will be indicated by the broadcaster
Sub-titles must be synchronous with the Time Code in the MXF file delivered.
The exit Time Code of the last sub-title in the programme must be positioned 3 seconds, at the latest,
before the last useful image of the programme to avoid the last sub-title getting stuck on the broadcast of
the following programme.
The general information in the « blocGSI » in the STL file (programme name, Time Code at the start of the
programme etc.) will be completed. The table below supplies extra information; the Broadcaster reserves
the right to modify the parameters at his discretion:
Teletext Mode
Programme title (OPT) To be completed
Version French Deaf and hard-of-hearing
Page code (CPN) 850
Time Code Format (TC) 25 i/s (STL25.01)
Display standard (DSC) Teletext – Presentation of level 1 (1)
Character code (CCT) Latin (00)
Language (LC) French (0F)
Time Code at the start of the
Complies with time code at start of MXF file delivered
programme (TCP)
Size of characters Double height – single width
Maximum number of characters
37
per line
Colour of sub-titles White As per CSA* chart
Number of rows (MNR) 23 23
Position of sub-titles Maximum 2 lines of text
As per CSA* chart
Horizontal centred
* http://www.csa.fr/content/download/20043/334122/file/Chartesoustitrage122011.pdf
Broadcasters may provide an example of an XML file along with a « MetaPADTV V1-x » Microsoft Excel file
containing pre-defined information zones to complete the various broadcasting and technical fields that describe
the programme
The « MetaPADTV V1-x » Excel file will include macros that allow the XML file to be generated.
The Excel file macros usually automatically generate the name of the MXF file using the data entered when the
XML is generated.
For a multi-party programme, the entrance/exit time codes for each section must necessarily be entered in the XML
file.
ACRONYM DEFINITION
CRC Cycling Redundance Code : Redundancy code to improve the sturdiness of digital information.
K ou R2LB Weighting curve derived from the B curve applied to an audio source within the framework of a
loudness measurement.
LU ou LUFS Loudness Unit or Loudness Unit relative to the Full Scale : unit for measuring loudness as defined in
recommendation EBU R128
MXF Material eXchange Format : file format devoted for the exchange of professional media
PCM Pulse Code Modulation: referenced as the digital representation of an analogue signal without any
flow reduction or compression algorithm.
PPM Peak Program Meter : type of peak-meter used for measuring an audio signal
SDI Serial Digital Interface : standard for transmitting an uncompressed digital television signal (audio,
video, ancillary data)
VANC Vertical ANCillary data space : digital information not integrated into the digital image
ACRONYM DEFINITION
AES Audio Engineering Society : international organisation of sound engineers, producing standards in
the field of sound
AMWA Advanced Media Workflow Association : organisation that specifies uses and recommendations for
dematerialised production
CSA Conseil Supérieur de l’Audiovisuel : the body that regulates television channels in France
CST Commission Supérieure Technique de l’Image et du Son : Association of Technicians from the
fields of sound and image in cinema and television.
EBU European Broadcast Union (Union Européenne de Radio Télévision), European organisation that
brings together public television and radio channels
HDFORUM French association that promotes improvements to the quality of television services. It comprises
three groups : broadcasters, industrial entities and distributors
IEC International Electrotechnical Commission : organisation that produces standards for the electric
and electronic industry
SMPTE Society of Motion Picture and Television Engineers : American association of television and cinema
engineers