Archive

Archive for the ‘Event’ Category

‘An introduction to forensic audio’

Title: ‘An introduction to forensic audio’
Location: Royal Academy of Engineering, London
Description: Lecture by Gordon Reid, CEDAR Audio Ltd
Start Time: 18:30 for 19:00
Date: Thursday 15th April 2010

Speech enhancement has come a long way in the digital era, but it is not the ‘magic wand’ depicted on TV and in Hollywood movies. Adaptive filters have traditionally been the basis of forensic audio work, but a combination of techniques – including broadband noise reduction, buzz removal, equalisation and background noise suppression – can provide superior results when compared with any single approach. This introduction, illustrated using examples processed in real-time on a CEDAR Cambridge Forensic system, aims to shed light on this, demonstrating how signal processing can aid investigators in areas including criminal investigation, counter-terrorism and air accident investigation.

Lecture Report

Note: we are unable to provide a recording of this lecture because some of CEDAR’s police and security customers place strict constraints on the public dissemination of the audio clips and details of cases used in demonstrations of CEDAR’s forensic technology.

Gordon Reid is the Managing Director of CEDAR Audio, a leading manufacturer of audio restoration and speech enhancement products. He kicked off his lecture with a scenario of how video surveillance, without audio content, can give ambiguous or even completely misleading indications of intent.

Audio forensics is a relatively new field that first entered common use in the 1960s/1970s. Thanks to the technology of companies like CEDAR, audio forensics is now an established field, and the most recent trend is for audio and video surveillance data to be integrated. Before the arrival of digital technology in the 1990s, audio forensics was relatively crude, using often poorly-maintained analogue tape recorders, no single-ended noise reduction, and often just analogue EQ and dynamics processes for clean-up.

Nowadays, recordings are mostly digital, and can be made using low-cost consumer equipment. But this brings some new problems. Recordings are often made by untrained people using small, cheap recorders: he highlighted a divorce litigation case in which a woman concealed a recorder at the bottom of her handbag, covered by a scarf and jumper to make sure it wasn’t found. Unsurprisingly, there was almost no discernable speech data on the resulting recording. So there are new problems to face, but fortunately, DSP algorithms and powerful computers can help get around many of these. But even these have limits: Gordon described a phenomenon known as the “CSI Effect”, whereby the public has unrealistic and fantasy-based expectations of surveillance restoration technology. He cited the apparently genuine example of a person who’d snapped a photo of the side of a speeding getaway vehicle on a mobile phone, and handed it to the police in the expectation that by rotating the side-on (and blurred, low-quality) image in a 3-D computer imaging system, they could read the license plate! But absurd cases aside, there is an increasing problem: the bad guys are increasingly aware of surveillance techniques, making (for example) body wires impractical because criminals know how to frisk for them effectively. They also know to hold sensitive conversations in locations where there is loud, effective masking noise such as running water or TV noise.

Gordon broke noise reduction technologies for audio forensics into two main applications: real-time surveillance and non-real-time laboratory investigation. Surveillance systems have live listeners (typically police or security officers), who may need to make fast, accurate and life-critical decisions based on what is heard. The principal requirements are low latency, high intelligibility and low listener fatigue. Non-real-time systems are typically used to produce evidence admissible for the courts, so the requirements are for high transcription accuracy, the retrieval of otherwise unintelligible speech, and to reduce transcriptor fatigue. Also, jurors are not trained listeners and courtrooms typically have very poor acoustics, so the presence of background noise may affect their judgement. He cited the case of a defence lawyer who used the presence of modest traffic and street noise on an intelligible recording of incriminating statements to cast doubt on the transcription of the recorded speech – and won.

Gordon listed the long-established principles of good non-covert audio evidence: a suitable recorder, competent operator, authentic recordings, recordings preserved such that they are demonstrable in court, speakers identified, evidence made voluntarily and in good faith – and no edits or changes made. The last point is potentially problematic as, in principle, it could exclude the enhancement processes that render noisy evidence intelligible. This is a grey area, with the degree of processing admissible dependent on the judge, court and jurisdiction. Clearly, there is a need to demonstrate that the processing has not modified the meaning of the evidence. For example, it’s not possible for the microscopic editing of a real-time declicking algorithm to change phonemes, and so change the meaning, but the court may need to be convinced of this. Additionally, proposed UK government regulations on handling evidence may be applied to audio evidence, potentially causing substantial problems when regulations designed to protect physical items are applied to digital media.

Gordon moved on to talk about the specifics of the technology used: it’s usually some combination of noise reduction, equalisation and level processing (e.g. dynamics processing). Dialogue noise suppression is a technology originally developed for the film industry, and CEDAR’s first product in this field, a real-time and very easy to use device, was aimed at post-production for film, video and TV: the typical application was to save a take that had been spoiled by ambient sound intrusion. This was contrasted with lab systems: large computer-based systems intended for off-line batch processing rather than real-time use.

The use of declickers was demonstrated. The earliest algorithms in this field were originally developed for 78rpm archives, but have been developed much further and are now extremely helpful in removing GSM noise, the familiar buzzing/pulsing interference caused by mobile phones. GSM noise can be shown to comprise buzz at around 217Hz and a series of pulses. The declicker can remove the impulsive noises, and the buzz can be removed with a dedicated Debuzz algorithm. The results of this were demonstrated with a 999 call recording, originally almost completely inaudible, but which when processed revealed much more information and the presence of a second, previously-unheard speaker in the background – of crucial important to the court case in which the recording was presented as evidence!

Gordon next discussed the use of adaptive filters. If the statistics of the noise are relatively constant, it’s possible to design a filter to separate speech (which tends to change rapidly) from the noise. Additional improvements can sometimes be achieved by treating low and mid frequencies differently to high frequencies, based on perceptual models of hearing and intelligibility.

Some of the interesting applications of adaptive filters include cleaning-up reverberant spaces such as holding cells and transfer vans, and removal of the 400Hz buzz from aircraft power systems that can degrade air traffic control recordings. And, in a reversal of the normal filtering, it was described how CEDAR removed the shouting from a cockpit voice recording in a helicopter that had just suffered a catastrophic mechanical failure, so the investigators could listen to the mechanical sounds to trace the cause of the accident.

Cross-channel adaptive filters can overcome steps taken to defeat surveillance, such as using loud radio or TV to mask a conversation. This type of filter exploits the correlation between the direct broadcast signal (if available) and the tonally altered broadcast signal present in the surveillance, and can effectively remove it from the surveillance signal. If there isn’t a convenient reference of the broadcast, use of multiple microphone locations causes some to have more speech and others to have more interfering signal, giving the cross-channel adaptive filter enough to work with. A reconstructed demonstration was played in which, when using a single mic recording of some speech in the presence of loud music from a radio, a transcription expert obtained approximately 30% accuracy. Adding a second mic positioned closer to the radio than the first and using this as the reference channel for the cross-channel adaptive filter, the intelligibility was hugely improved, and the transcription accuracy increased to 100%.

The form of broadband noise reduction known as spectral subtraction is an impressive tool in music production and restoration, but in forensics its use can be more limited: although it improves listenability and reduces fatigue, the best that can be hoped for regarding intelligibility is that it doesn’t damage it. Nonetheless, it has significant other uses in audio forensics, such as removing the hiss that can be added by adaptive filters. EQ, despite its simplicity and ubiquity, has been a staple processor for forensics since long before the days of DSP and adaptive filters. Removal of low frequencies and the addition of a little boost in the upper mids can hugely increase intelligibility. Limiters are used to reduce the impact of sudden loud noises. By its nature, forensic audio can involve extreme dynamic ranges. When a surveillance officer or transcriptor is listening closely to very low-level signals at very high gain, loud sounds such as gunshots/vehicle crashes/etc. can, without limiting, damage the listener’s hearing. In other cases, such as a recording of a telephone conversation made using a hand-held recorder, balancing the levels of the local and remote speaker can help render the evidence more intelligible and therefore more useful in court.

Gordon mentioned the increasingly widespread suspicion that audio data mining is being deployed by security agencies: that is, mass interception of all voice communications with automatic recognition of certain key words (e.g. bomb, jihad, etc.). Gordon’s view is that this is not currently technically practical, but that its use may increase within a decade or two. What is currently feasible, and is being used to an ever greater degree, is automatic speaker recognition: commercial solutions are developing fast, but their robustness to voice signals that have been altered by enhancement processing is an ongoing research field. Another significant recent development is the prevalence of low bit-rate, highly-compressed perceptual codecs, which can make both enhancement and automatic speaker recognition more problematic.

Gordon concluded his lecture with a mention of spectrographic editing, which was invented by CEDAR. Time-domain editing can be recognised in a spectrograph, making this kind of evidence-tampering obvious. But spectrographic editing allows powerful manipulation of the signal, often invisible to future investigation. This tampering can be very dangerous in the wrong hands, but when used ethically can reduce or remove masking signals, making it a powerful enhancement tool.

Many thanks to Gordon for an eye-opening lecture, and his fascinating insights into the remarkable technology his company has created.

Lecture report by Michael Page

‘Remastering and Audio Restoration at Abbey Road Studios’

Title: ‘Remastering and Audio Restoration at Abbey Road Studios’
Location: Royal Academy of Engineering, London
Description: Lecture by Simon Gibson of Abbey Road Studios
Start Time: 18:30 for 19:00
Date: Tuesday 11th May 2010

EMI has an archive going back to 1898 and, since Abbey Road Studios opened in 1931, there has been a gradual increase in the remastering of that back catalogue for new formats. Starting with a potted history of EMI, the early years of recording and the work of Alan Blumlein, we move on to the emergence of remastering at Abbey Road and the systems and techniques used today. The talk will then concentrate on the use made of CEDAR Audio’s Retouch software in the audio restoration of The Beatles album remasters as well as its more usual use in the creation of music for the video game The Beatles Rockband. Along the way we will hear rare audio extracts from EMI’s archive and clips from The Beatles’ recordings to demonstrate these remastering and restoration techniques.

Technical Visit: National Film and Television School

Title: Technical Visit: National Film and Television School
Location: NFTS, Beaconsfield
Description: More details to follow
Start Time: 18:30
Date: Thursday 19th November 2009

The National Film and Television School in Beaconsfield (www.nftsfilm-tv.ac.uk) is the only UK film and television school based in its own studios, allowing students to make real films and television programmes in industry-standard facilities. Prominent members of the British film industry who teach at the school include Stephen Frears (The Queen), Brian Tufano (Trainspotting), David G Croft (Shooting Stars, Baddiel’s Syndrome), Lynda Myles (The Commitments) and Nik Powell (Ladies in Lavender, The Crying Game, Little Voice).

Originally called The National Film School, it opened in 1971 following four years of planning to create an institution to train new talent for the British film industry. With a loan from Rank, NFS bought the old Beaconsfield Film Studios in Buckinghamshire – which had previously been home to British Lion Film Corporation and The Crown Film Unit – and refitted it to professional film making standards.

During our evening visit, which a maximum of 30 members can attend, a brief introductory presentation in the school’s digital cinema will be followed by a tour of the studios and sound post-production areas. A film shoot will be in progress in the main studio and the television studio will have a daily magazine show in production. Access should be available to the school’s three new Euphonix dubbing theatres, even though they are in the middle of graduation productions, and to the technical sound hub.

After the tour the group will return to the cinema where Andrew Boulton, Head of Sound Production, will explain in more detail the educational and creative mission of the school from the sound perspective, with a few clips to demonstrate the range of work produced by students, before inviting questions from the audience.

Note that all AES UK technical visits are only open to AES members.

‘Improved methods for controlling touring loudspeaker arrays’

Title: ‘Improved methods for controlling touring loudspeaker arrays’
Location: Royal Academy of Engineering, London
Description: Lecture by Ambrose Thompson, Martin Audio
Start Time: 18:30 for 19:00
Date: Tuesday 9th March 2010

Download the recording of this lecture here (12MB MP3)

The focus of this paper is a popular type of line array loudspeaker used for large- and medium-scale sound reinforcement. These systems are required to deliver very high SPL to a large audience area sometimes as far as 100m from the array, but typically in the 30-70m range. This class of line array is characterised by relatively widely spaced acoustic sources, each with high vertical directionality compared to the more traditional steered column loudspeaker where the acoustic sources are small and tightly spaced. These differences, together with the fact that large audience regions are typically in the near-field, preclude the use of the existing techniques to control linear arrays.

Currently successful methods of control were examined and found to be inadequate for meeting a new more stringent set of user requirements. This paper describes how users of the modern articulated line array loudspeakers used for high level sound reinforcement can control these systems with more precision, and explains how these requirements can be formed into a mathematical model of the system suitable for numerical optimisation. The primary design variable for optimisation was the complex transfer functions applied to each acoustic source. How the optimised transfer functions were implemented with IIR/FIR filters on typically available hardware is explained, and a comparison made between the predicted and measured output for a large array.

Special Lecture: An interview with Neville Thiele

Title: An interview with Neville Thiele
Location: Royal Academy of Engineering, London
Description: Special Lecture: Interview conducted by Keith Howard
Start Time: 18:30 for 19:00
Date: 24th November 2009

Download audio recording of lecture (24MB MP3)

An excellent Tutorial by Neville Thiele can be found here (AES Members only, log-in required for www.aes.org)

Abstract

Neville Thiele’s name is known to anyone who has ever taken an interest in the practical design of moving coil loudspeakers, through the Thiele-Small parameters that bear his name and that of Richard Small. In 1961 he wrote a seminal paper on the design of vented (reflex) loudspeakers that – although it was largely ignored for 10 years until reproduced in the AES Journal – is now acknowledged as initiating the filter parameter based approach to loudspeaker analysis and synthesis which today is routinely used by the audio industry at large. In recognition of this, in 1994 he was awarded the AES Silver Medal.

In this interview-based lecture, Neville Thiele will talk about what led up to this breakthrough and its significance to the speaker design process. He will then give three short presentations on loudspeaker-related topics: filter-assisted bass alignments and novel crossover approaches; driver ageing effects; and driver impedance correction in crossover networks. Questions will then be invited from the audience.

‘The anatomy of a modern audio-video amplifier’

Title: ‘The anatomy of a modern audio-video amplifier’
Location: Royal Academy of Engineering, London
Description: Lecture by John Dawson, Arcam
Start Time: 18:30 for 19:00
Date: Tuesday 10th November 2009

Download audio recording of lecture (14MB MP3)

A modern Audio-Video amplifier/receiver (AVR) is an exceedingly complex piece of consumer electronics, requiring expertise in many aspects of analogue and digital audio and high definition video, plus considerable software skills. As such it represents a huge project for any small to medium sized audio company. This lecture takes a look inside the Arcam AVR600 – one of the few such units developed outside of the large Japanese CE companies – and will discuss some of the design choices made in order to try to ensure a good chance of commercial success.

‘From hi-fi to PA: predicting and measuring what we hear’

Title: ‘From hi-fi to PA: predicting and measuring what we hear’
Location: Royal Academy of Engineering, London
Description: Lecture by Peter Mapp, Mapp Associates
Start Time: 18:30 for 19:00
Date: Tuesday 20th October 2009

Download MP3 recording of this lecture (24MB)

Everyone wants ‘high quality’ sound – but what does this mean – is sound quality measureable? is it predictable? The talk will look at how we can assess sound quality – both in large spaces such as concert halls, cathedrals and even railway stations as well as in small rooms such as home theatres and hi-fi listening spaces. After introducing a number of parameters and concepts that affect sound quality and the listening experience, Peter will discuss how these can be measured and potentially predicted. In particular the use of 3D computer modeling of rooms will be highlighted together with the importance of bass frequency reproduction. A number of case studies and examples of problem sound systems/rooms will be presented. Peter will conclude the talk with an insight into some of his latest research and the introduction of a new measurement/assessment concept, SQI – the Sound Quality Index.

Technical Visit to Pinewood Studios

Title: Technical Visit to Pinewood Studios
Location: Pinewood Studios, Iver, Buckinghamshire
Start Time: 16:00
Date: 17 June 2009

‘How to make a high-resolution record label’

Title: ‘How to make a high-resolution record label’
Location: Royal Academy of Engineering
Description: Lecture by Philip Hobbs of Linn Records
Start Time: 18:30 for 19:00
Date: 9 June 2009

Philip Hobbs

The New HiFi

High Resolution Music and the next generation of enthusiasts

9th June 2009. Held at RAE, London.

Philip Hobbs

Phillips Hobbs is a Producer and Audio Consultant at Linn Records Ltd. He first worked for Linn in 1982, leaving to study on the Tonnmeister course, returning to Linn in 1987 after graduating. Philip’s main roles at Linn have been in music recording and speaker design. Philip described himself as being the ‘worst sort of communicator’, because, according to him, he is both ‘Scottish and an engineer’.

Philip talked tonight of how Linn’s business has been ‘transformed over the last 3 years’ by the introduction of their music download service, a service where the customer can choose the download quality all the way to 192kHz, 24bit and where all the downloads DRM-free.

Linn History

Phillip gives a ‘two minute trip down memory lane’ of how Linn started

Linn was founded by Ivor Tiefenbrun as an offshoot of Castle Precision Engineering, a machining company who made parts for such things as aircraft and Rolls Royce. The original home of Linn, Linn Business Park gave Linn it’s name, and Linn established it’s HiFi  pedigree with the Linn Sondek LP12, a record player still in production today.

Linn expanded it’s product range and has made amplifiers, CD players, active loudspeakers and Digital Stream Players which stream files from hard disk.

Linn The Record Label

Like many other hardware manufacturers, Linn developed an interest in the recording industry. The initial motivation was to make recordings to test the reproduction capability of the LP12 and to investigate vinyl cutting lathes to the same end, but Linn has subsequently blossomed into a serious audiophile record label with many original recordings.

Traditionally focused on classical music, Carol Kydd was the first Linn ‘proper jazz artist’ and Linn released her first album in 1983. Linn made an initial pressing of 7000 records, selling them through record shops. In 1984, Linn hooked up with a  band called Blue Nile, releasing their first album ‘A Walk Across the Rooftops’. Blue Nile were keen to sell lots of records and Linn ’spent hundreds of thousands trying to get them to release their second album’. Philip, who was designing speakers for Linn at this time, estimates the total bill in relation to Blue Nile to be just under £1 million. By 1992, Linn were working on building their classical catalogue in the ’standard boutique label’ philosophy by focusing on recording quality.

By 2006, Linn had around 250 titles and had established distribution in Japan and America. In Philip’s words, the business ‘was a complete catastrophe’, as it was ‘not economically viable to sell CDs in commercial retail space’, a trend, according to Philip, that had been developing since the 1990s. Philip recalls that the situation had become so bad by 2006, Linn were faced with a decision either to leave the record business altogether, or to find some radical new approach – to find ‘a way to get back to the customers’ avoiding ‘the frustration that traditional retailing gives to many companies, and record companies in particular’, namely that ‘the company is so far away from the people they’re selling to’.

Download Revolution

The conclusion at Linn was that they needed to use the internet ‘to connect directly to the customers without compromising on quality’. At this time, Apple’s I-Tunes service was well established and, thanks to the widening availability of fast broad-band services, the ‘possibility you could sell someone 1GB of data’ was becoming a reality.

So Linn built a web site where customers could download music directly – similar to the I-Tunes idea, but with a unique selling point – the ability to provide downloads up to 24-bit 192khz sampling rate (and loss-less), where the customer is free to choose the download resolution/sampling frequency from studio-master down to MP3-level quality.  Like I-Tunes, customers can buy individual tracks or whole albums with the higher quality downloads commanding a higher price tag.

According to Philip, despite an initial cost of £100k, the site is now profitable after around 2.5 years of service.

Philip stated the more traditional music distribution method of physical  media in shop-retail results in around a 15-20% share of the ticket price being returned to the record label. For example, if a CD retails for £15, £2.5 for the record company would be considered as ‘doing pretty well’. With the download service which operates in the absence of ‘middlemen’, the margins increase considerably. Philip estimates that the download business returns around 80% of profit, and further,  their  profits would probably increase were they to move entirely away from physical media, (which Linn continue to support out of loyalty to a minority of, presumably similarly loyal,  customers).

High Quality Master Recordings

A key factor that made Linn’s Hi-Res download business viable was their long-term focus on making recordings of the best quality possible, a commitment which had led them to make many of their master recordings at 96kHz or 192kHz. This resulted in a ready supply of high-resolution back catalogue. This situation was, according to Philip, in contrast to many other record companies whose masters were typically made/archived at 44.1khz or 48kHz.

Customer Focus

Linn have a base of around 120,000 customers. With their focussed direct-marketing approach, a Friday evening email-newsletter often results in many £1000s or business by Monday from their download service.

Philip also points out that these direct marketing activities rarely offer significant discounts (which would reduce profits)  – usually, they are simply aimed to draw the customers attention to some new material or other works that may be similar to previously purchased material.

Download Usage – how are the customers using the downloads?

Philip sees 4 main customer types split by playback method.

  • PCs with sound cards, Windows Media Player, I-Tunes etc.
  • Portable Devices (I-Pods, Zune etc.)
  • Burn-to-Disc – customers making CDR/DVD-R copies
  • Streamed Media Players – from Linn and others – files streamed from local server

The DRM Issue

Linn considered the possibility of using DRM to protect their downloaded material, but at the time the web site was being prepared, it became clear to Linn that DRM just didn’t work sufficiently well. According to Philip, many people felt that moral arguments eventually killed DRM but wonders whether it was in large part due to an inability to make smoothly working system (and without imposing excessively limiting restrictions on the customers).

Download Formats

Offering such a wide range of download quality options has provided Linn with some interesting statistics on the decisions customers make when offered a quality/cost choice.  Despite price differentials, In 2007, 25% of purchases were of the ’studio master’ quality. By 2008, the figure had rising to around 50%, and so far in 2009, this seems to have risen further to around 70%. Of the CD quality albums downloaded, customers are showing a 50/50 split between choosing FLAC and WMA.

Additionally, of those customers who purchased studio master quality downloads, where they were offered a choice between 96kHz or 192kHz,   80% chose  the higher rate in spite of the fact that many players can’t play 192kHz!

Further, Phil is convinced that around half the customers who have purchased  studio-master quality downloads don’t currently have the playback equipment to support the sample rate/bit-depth they bought. His conclusion is that given a choice, Linn’s customers prefer to buy the best quality available. If this seems odd, there may be some logic here, and in some way maintaining the Linn tradition. The original Linn LP12 can be upgraded all the way to its current production specification. This ability to upgrade has been a Linn philosophy, at least for the LP12, for many years. By upgrading the equipment, the customer can benefit without buying into a whole new format. If the customer buys the Studio Master, the data they get is all that was recorded – it is essentially ‘as good as it will ever be’ – and with such a music collection, future equipment upgrades may offer further sound improvements when replaying the original material, in many ways similar to vinyl.

High Resolution Benefits

Philip made an impactual demonstration of the potential enjoyment offered by high resolution and high quality recording by playing Handel’s Messiah conducted by John Butt (and where Philip was himself the recording engineer). Unbeknown to the audience, the recording began at rate of 88.2kHz/ 24-bit, but as playback progressed, the bit rate dropped to 44.1khz 16 bit, then to 192kb mp3, then to 96kb, mp3. Although these differences were not immediately obvious to all, (at least in the listening environment in which they were presented), Philip described how it was common for the listeners attention to progressively drift to other matters as the bit-rate dropped, they ‘tend to get bored and start thinking about something else’. This certainly described my personal experience with surprising accuracy.

Streaming Player

Philip briefly demonstrated one of the Linn Streaming Players which offer one possible method of replaying the downloaded material. One of the benefits Philip sees for customers with this type of equipment is a significant increase in convenience. Gone are the walls of CD/LP shelves, replaced by a compact hard-disk-based server and controlled via a little application on their I-Phone, a use case Philip describes as ‘addictive’.

The Future

Linn are beginning to diversify. They have taken on a couple of small labels and are offering downloads for them alongside their own material. For those interested in purchasing downloads, Linn’s web site may be found at http://www.linnrecords.com/ and test files for evaluating quality (and compatibility)  can be found at http://www.linnrecords.com/linn-downloads-testfiles.aspx

The AES would like to thank Phillip for his fascinating talk. I’m sure many members were greatly encouraged to hear that there are still many customers for whom recording quality something worth paying for.

Report by Nathan Bentall

Edited by Keith Howard

‘Critical listening/evaluation – a path to the future of quality music’

Title: ‘Critical listening/evaluation – a path to the future of quality music’
Location: Royal Academy of Engineering, London
Description: George Massenburg of George Massenburg Labs
Start Time: 18:30 for 19:00
Date: 3rd June 2009

George Massenburg needs little introduction – even if you don’t know of him, you have probably heard his recordings. For a detailed biography, see www.massenburg.com/cgi-bin/ml/bio.html.

Meeting Report

Quality recordings

What is difficult to represent in this report is the passion George exudes about music, a passion which drives him to strive (and help others to strive) to continually improve the quality of recorded music. Many recordings were replayed in the course of this lecture, some made by George, others not. Most were 192kHz, 24-bit; some were transferred from analogue master tapes.

George began by replaying a Diana Krall track, pointing out the subtlety and detail captured by Al Schmitt. In a change of style, the next track was by Neil Young – a new song about the recent financial crisis with the chorus line “A bailout is coming, but not for you”. Elements of the recording were described, there being a pair of guitars (slide and acoustic), rock’n'roll drums and a hi-hat “somewhere in the background”.

George then played a clip from YouTube of a recent and currently very popular track by Autotune The News (their second track, pirates. drugs. gay marriage), an original piece where television newsreaders have been cleverly edited in time and pitch such that they appear to be singing. The point here? Although the YouTube clip has been extremely popular (it received 1.5 million hits in the first week, possibly setting a web record), and although George admitted to thinking it “brilliant”, the audio quality is very poor. George pointed out that repeated listening at this YouTube-quality quickly gets very annoying because of the low-fidelity sound.

Compression Artefacts

George then played the results of some subtraction tests on lossy audio codecs, a technique which George refers to as the Moorer test as it was originally suggested by James A Moorer. In these tests, high-quality 192kHz, 24-bit recordings were converted to various encoded forms such as MP3 and AAC. The encoded files were then decoded and upsampled back to the original 192kHz, 24-bit. A sample-by-sample subtraction was then performed, and the resultant difference – the error introduced by the codec – then replayed. The resulting error signal is surprisingly high in amplitude (estimated by George as typically 25-30% peak), clearly correlated to the signal and with a complex relationship to the original sound (not simple harmonic distortion).

George takes the view that his students should learn to recognise the nature of the codec error using this subtraction method and then listen to the encoded music. Using this learning technique, listeners can familiarise themselves with the artefacts’ sound in isolation, and can subsequently pick them out more readily when the encoded material is played.

Listen Again

George believes that every time we hear a piece of music we should have the possibility of hearing something new – “to take home something else” – and that this is more readily achieved with high resolution recordings. Although George concedes that it’s possible to make a “pretty good” 44.1kHz/16-bit CDs, he remembers the first time he heard a digital recording: rather than being impressed, he was “horrified”.

Subsequent work to push the boundaries of converter technology (George recalls the contribution of Paul Frindle in this area) has convinced him that good digital now is good. He believes that we don’t have to go back to magnetic tape to make good records, and describes himself as having “an easy peace” with both vinyl and analogue tape.

Recording Tips

George deprecates recording techniques in which small elements are recorded separately and later combined/corrected/stretched/re-tuned, etc. He believes a key to great music recording is to maintain a performance focus. Preferably, the band should perform and be recorded playing simultaneously in the same space. George offers these suggestions to help your next recording:

  • 1) Only use destructive record.
  • 2) No punch-ins.
  • 3) No one is allowed to take the recording home and ‘tweak’ it – they can do another take, but the previous one will be overwritten.

The AES UK section and George wish to thank the companies who kindly supplied equipment for this lecture, namely ATC (monitor loudspeakers), Digidesign (ProTools system), Arcam (DVD player) and Prism Sound (D/A converters).

Report by Nathan Bentall (edited by Keith Howard)