Using RemoteIO audio unit

I’ve had nasty old time trying to get some audio stuff going on the iPhone, no thanks to Apple’s lack of documentation. If you’re an iPhone developer interested in getting RemoteIO/IO Remote/whatever it’s called working on the iPhone… Do I have good news for you. Read on.

p style=”font-style: italic”>Wanna skip the Core Audio learning curve and start writing code straight away? Check out my new project:

Update: Thanks to Joel Reymont, we now have an explanation for the “CrashIfClientProvidedBogusAudioBufferList” iPhone simulator bug: The simulator doesn’t like mono audio. Thanks, Joel!

Update: Happily, Apple have now created some excellent documentation on Remote IO, with some good sample projects. I recommend using that as a resource, now that it’s there, as that will continue to be updated.

Update: Tom Zicarelli has created a very extensive sample app that demonstrates the use of AUGraph, with all sorts of goodies.

So, we need to obtain an instance of the RemoteIO audio unit, configure it, and hook it up to a recording callback, which is used to notify you that there is data ready to be grabbed, and where you pull the data from the audio unit.

Overview

Identify the audio component (kAudioUnitType_Output/ kAudioUnitSubType_RemoteIO/ kAudioUnitManufacturerApple)
Use AudioComponentFindNext(NULL, &descriptionOfAudioComponent) to obtain the AudioComponent, which is like the factory with which you obtain the audio unit
Use AudioComponentInstanceNew(ourComponent, &audioUnit) to make an instance of the audio unit
Enable IO for recording and possibly playback with AudioUnitSetProperty
Describe the audio format in an AudioStreamBasicDescription structure, and apply the format using AudioUnitSetProperty
Provide a callback for recording, and possibly playback, again using AudioUnitSetProperty
Allocate some buffers
Initialise the audio unit
Start the audio unit
Rejoice

Here’s my code: I’m using both recording and playback. Use what applies to you!

Initialisation

Initialisation looks like this. We have a member variable of type AudioComponentInstance which will contain our audio unit.

The audio format described below uses SInt16 for samples (i.e. signed, 16 bits per sample)

#define kOutputBus 0
#define kInputBus 1
 
// ...
 
 
OSStatus status;
AudioComponentInstance audioUnit;
 
// Describe audio component
AudioComponentDescription desc;
desc.componentType = kAudioUnitType_Output;
desc.componentSubType = kAudioUnitSubType_RemoteIO;
desc.componentFlags = 0;
desc.componentFlagsMask = 0;
desc.componentManufacturer = kAudioUnitManufacturer_Apple;
 
// Get component
AudioComponent inputComponent = AudioComponentFindNext(NULL, &amp;desc);
 
// Get audio units
status = AudioComponentInstanceNew(inputComponent, &amp;audioUnit);
checkStatus(status);
 
// Enable IO for recording
UInt32 flag = 1;
status = AudioUnitSetProperty(audioUnit, 
                              kAudioOutputUnitProperty_EnableIO, 
                              kAudioUnitScope_Input, 
                              kInputBus,
                              &amp;flag, 
                              sizeof(flag));
checkStatus(status);
 
// Enable IO for playback
status = AudioUnitSetProperty(audioUnit, 
                              kAudioOutputUnitProperty_EnableIO, 
                              kAudioUnitScope_Output, 
                              kOutputBus,
                              &amp;flag, 
                              sizeof(flag));
checkStatus(status);
 
// Describe format
audioFormat.mSampleRate         = 44100.00;
audioFormat.mFormatID           = kAudioFormatLinearPCM;
audioFormat.mFormatFlags        = kAudioFormatFlagIsSignedInteger | kAudioFormatFlagIsPacked;
audioFormat.mFramesPerPacket    = 1;
audioFormat.mChannelsPerFrame   = 1;
audioFormat.mBitsPerChannel     = 16;
audioFormat.mBytesPerPacket     = 2;
audioFormat.mBytesPerFrame      = 2;
 
// Apply format
status = AudioUnitSetProperty(audioUnit, 
                              kAudioUnitProperty_StreamFormat, 
                              kAudioUnitScope_Output, 
                              kInputBus, 
                              &amp;audioFormat, 
                              sizeof(audioFormat));
checkStatus(status);
status = AudioUnitSetProperty(audioUnit, 
                              kAudioUnitProperty_StreamFormat, 
                              kAudioUnitScope_Input, 
                              kOutputBus, 
                              &amp;audioFormat, 
                              sizeof(audioFormat));
checkStatus(status);
 
 
// Set input callback
AURenderCallbackStruct callbackStruct;
callbackStruct.inputProc = recordingCallback;
callbackStruct.inputProcRefCon = self;
status = AudioUnitSetProperty(audioUnit, 
                              kAudioOutputUnitProperty_SetInputCallback, 
                              kAudioUnitScope_Global, 
                              kInputBus, 
                              &amp;callbackStruct, 
                              sizeof(callbackStruct));
checkStatus(status);
 
// Set output callback
callbackStruct.inputProc = playbackCallback;
callbackStruct.inputProcRefCon = self;
status = AudioUnitSetProperty(audioUnit, 
                              kAudioUnitProperty_SetRenderCallback, 
                              kAudioUnitScope_Global, 
                              kOutputBus,
                              &amp;callbackStruct, 
                              sizeof(callbackStruct));
checkStatus(status);
 
// Disable buffer allocation for the recorder (optional - do this if we want to pass in our own)
flag = 0;
status = AudioUnitSetProperty(audioUnit, 
                              kAudioUnitProperty_ShouldAllocateBuffer,
                              kAudioUnitScope_Output, 
                              kInputBus,
                              &amp;flag, 
                              sizeof(flag));
 
// TODO: Allocate our own buffers if we want
 
// Initialise
status = AudioUnitInitialize(audioUnit);
checkStatus(status);

#define kOutputBus 0 #define kInputBus 1 // ... OSStatus status; AudioComponentInstance audioUnit; // Describe audio component AudioComponentDescription desc; desc.componentType = kAudioUnitType_Output; desc.componentSubType = kAudioUnitSubType_RemoteIO; desc.componentFlags = 0; desc.componentFlagsMask = 0; desc.componentManufacturer = kAudioUnitManufacturer_Apple; // Get component AudioComponent inputComponent = AudioComponentFindNext(NULL, &desc); // Get audio units status = AudioComponentInstanceNew(inputComponent, &audioUnit); checkStatus(status); // Enable IO for recording UInt32 flag = 1; status = AudioUnitSetProperty(audioUnit, kAudioOutputUnitProperty_EnableIO, kAudioUnitScope_Input, kInputBus, &flag, sizeof(flag)); checkStatus(status); // Enable IO for playback status = AudioUnitSetProperty(audioUnit, kAudioOutputUnitProperty_EnableIO, kAudioUnitScope_Output, kOutputBus, &flag, sizeof(flag)); checkStatus(status); // Describe format audioFormat.mSampleRate = 44100.00; audioFormat.mFormatID = kAudioFormatLinearPCM; audioFormat.mFormatFlags = kAudioFormatFlagIsSignedInteger | kAudioFormatFlagIsPacked; audioFormat.mFramesPerPacket = 1; audioFormat.mChannelsPerFrame = 1; audioFormat.mBitsPerChannel = 16; audioFormat.mBytesPerPacket = 2; audioFormat.mBytesPerFrame = 2; // Apply format status = AudioUnitSetProperty(audioUnit, kAudioUnitProperty_StreamFormat, kAudioUnitScope_Output, kInputBus, &audioFormat, sizeof(audioFormat)); checkStatus(status); status = AudioUnitSetProperty(audioUnit, kAudioUnitProperty_StreamFormat, kAudioUnitScope_Input, kOutputBus, &audioFormat, sizeof(audioFormat)); checkStatus(status); // Set input callback AURenderCallbackStruct callbackStruct; callbackStruct.inputProc = recordingCallback; callbackStruct.inputProcRefCon = self; status = AudioUnitSetProperty(audioUnit, kAudioOutputUnitProperty_SetInputCallback, kAudioUnitScope_Global, kInputBus, &callbackStruct, sizeof(callbackStruct)); checkStatus(status); // Set output callback callbackStruct.inputProc = playbackCallback; callbackStruct.inputProcRefCon = self; status = AudioUnitSetProperty(audioUnit, kAudioUnitProperty_SetRenderCallback, kAudioUnitScope_Global, kOutputBus, &callbackStruct, sizeof(callbackStruct)); checkStatus(status); // Disable buffer allocation for the recorder (optional - do this if we want to pass in our own) flag = 0; status = AudioUnitSetProperty(audioUnit, kAudioUnitProperty_ShouldAllocateBuffer, kAudioUnitScope_Output, kInputBus, &flag, sizeof(flag)); // TODO: Allocate our own buffers if we want // Initialise status = AudioUnitInitialize(audioUnit); checkStatus(status);

Then, when you’re ready to start:

OSStatus status = AudioOutputUnitStart(audioUnit);
checkStatus(status);

And to stop:

OSStatus status = AudioOutputUnitStop(audioUnit);
checkStatus(status);

Then, when we’re finished:

AudioComponentInstanceDispose(audioUnit);

And now for our callbacks.

Recording

static OSStatus recordingCallback(void *inRefCon, 
                                  AudioUnitRenderActionFlags *ioActionFlags, 
                                  const AudioTimeStamp *inTimeStamp, 
                                  UInt32 inBusNumber, 
                                  UInt32 inNumberFrames, 
                                  AudioBufferList *ioData) {
 
    // TODO: Use inRefCon to access our interface object to do stuff
    // Then, use inNumberFrames to figure out how much data is available, and make
    // that much space available in buffers in an AudioBufferList.
 
    AudioBufferList *bufferList; // &lt;- Fill this up with buffers (you will want to malloc it, as it&#039;s a dynamic-length list)
 
    // Then:
    // Obtain recorded samples
 
    OSStatus status;
 
    status = AudioUnitRender([audioInterface audioUnit], 
                             ioActionFlags, 
                             inTimeStamp, 
                             inBusNumber, 
                             inNumberFrames, 
                             bufferList);
    checkStatus(status);
 
    // Now, we have the samples we just read sitting in buffers in bufferList
    DoStuffWithTheRecordedAudio(bufferList);
    return noErr;
}

Playback

static OSStatus playbackCallback(void *inRefCon, 
                                  AudioUnitRenderActionFlags *ioActionFlags, 
                                  const AudioTimeStamp *inTimeStamp, 
                                  UInt32 inBusNumber, 
                                  UInt32 inNumberFrames, 
                                  AudioBufferList *ioData) {    
    // Notes: ioData contains buffers (may be more than one!)
    // Fill them up as much as you can. Remember to set the size value in each buffer to match how
    // much data is in the buffer.
    return noErr;
}

Finally, rejoice with me in this discovery ;)

Resources that helped

No thanks at all to Apple for their lack of accessible documentation on this topic – They really have a long way to go here! Also boo to them with their lack of search engine, and refusal to open up their docs to Google. It’s a jungle out there!

Update: You can adjust the latency of RemoteIO (and, in fact, any other audio framework) by setting the kAudioSessionProperty_PreferredHardwareIOBufferDuration property:

float aBufferLength = 0.005; // In seconds
AudioSessionSetProperty(kAudioSessionProperty_PreferredHardwareIOBufferDuration, 
                        sizeof(aBufferLength), &amp;aBufferLength);

This adjusts the length of buffers that’re passed to you – if buffer length was originally, say, 1024 samples, then halving the number of samples halves the amount of time taken to process them.

Another Update: In the comments below, Florian Bomers pointed out that I was incorrectly using the AudioUnitUninitialize to clean up the Audio Unit. This is incorrect, and should in fact be AudioComponentInstanceDispose. Further discussion here. Cheers Florian!

One Comment

Rarejai
December 7, 2011 at 6:28 pm
Thanks for the great example. I am getting stuck for the audio playback…my audio format is like this:
```
AudioStreamBasicDescription streamFormat;
streamFormat.mSampleRate = 8000;
streamFormat.mFormatID = kAudioFormatULaw;
streamFormat.mFormatFlags = 0;
streamFormat.mFramesPerPacket = 1;  
streamFormat.mBytesPerFrame = 2;
streamFormat.mBytesPerPacket = streamFormat.mBytesPerFrame;
streamFormat.mChannelsPerFrame = 1;
streamFormat.mBitsPerChannel = 16;
```
I get a constant noise even I just return no error on the playback callback function. Please help. Thanks!
1. Michael Tyson
  December 8, 2011 at 12:12 pm
  
  Hey Rarejai – The ULaw format is for storage only, for use with things like the Audio File Services. Remote IO only works with PCM.
  1. Rarejai
    December 8, 2011 at 2:21 pm
    Hi Michael – actually I have a hardware device which streaming out u-law audio data. I need to play the audio on iPad. The data size is 1200 bytes per package. I have pass the data (NSData *) to the playback callback function. Here is my callback:
    
    NSAutoreleasePool *pool = [[NSAutoreleasePool alloc] init]; ShowView *viewController = (ShowView *)inRefCon; NSData *data = [[NSData alloc] initWithData:viewController->audioData]; if ([data length] > 0) { const int channel = 0; Float32 *buffer = (Float32 *)ioData->mBuffers[channel].mData; // Generate the samples for (UInt32 frame = 0; frame < inNumberFrames; frame++) { NSData *bytedata = [NSData dataWithBytes:[data bytes] + (frame * 4) length:4]; unsigned char *fileBytes = (unsigned char *)malloc([bytedata length]); [bytedata getBytes:fileBytes]; buffer[frame] = *(Float32*)(fileBytes); free(fileBytes); } viewController->audioData = nil; } [data release]; [pool release]; return noErr;
    
    The audio is playing good. Just there are some constant noise from the background. Any suggestion is appreciated. Thanks!
    1. Michael Tyson
      December 9, 2011 at 2:47 pm
      
      Honestly I’m surprised it works at all – I suggest you use an audio converter, and use PCM audio for the audio unit.
      
      In terms of your implementation, firstly, I don’t see any synchronisation going on. That code will run on a separate thread, which means you’ll get all sorts of problems unless you’re careful with how you access the data. You should use a ring buffer or something. If you’re not familiar with multithreading principles, I suggest grabbing a good book or finding a tutorial online, because it’s very, very important to get right. Note, though, that you should avoid holding locks on the high-priority Core Audio thread. A lock-free ring buffer is the best bet. I’ve built one, or you could grab Kurt Revis’ VirtualRingBuffer, which is very, very clever, and I think better than mine.
      
      Also, you’re doing an awful lot of memory allocation there – you should really avoid any kind of allocation in Core Audio’s realtime thread.
      
      If you need further help, I suggest posting on the Core Audio API mailing list.
ivan
December 22, 2011 at 9:12 am

actually I wanted to play amr file .So I use decoder .It worked,but very noisy and abnormal.

static OSStatus playbackCallback(void *inRefCon,
AudioUnitRenderActionFlags *ioActionFlags,
const AudioTimeStamp *inTimeStamp,
UInt32 inBusNumber,
UInt32 inNumberFrames,
AudioBufferList *ioData) {
//loop through all the buffers that need to be filled
for (int i = 0 ; i < ioData->mNumberBuffers; i++){
AudioBuffer buffer = ioData->mBuffers[i];
UInt32 *frameBuffer = buffer.mData;
AmrDecode(frameBuffer);
}
return noErr;
}

I really need some help
ivan
December 22, 2011 at 9:14 am
Hi Michael,
I tried to play 8000-rate ,and kinda fail.
```
audioFormat.mSampleRate         = 8000.0;
audioFormat.mFormatID           = kAudioFormatLinearPCM;
audioFormat.mFormatFlags        = kAudioFormatFlagIsSignedInteger | kAudioFormatFlagIsPacked;
audioFormat.mFramesPerPacket    = 1;
audioFormat.mChannelsPerFrame   = 1;
audioFormat.mBitsPerChannel     = 16;
audioFormat.mBytesPerPacket     = 2;
audioFormat.mBytesPerFrame      = 2;
audioFormat.mReserved           = 0;
```
actually I wanted to play amr file .So I use decoder .It worked,but very noisy and abnormal.

static OSStatus playbackCallback(void *inRefCon,
AudioUnitRenderActionFlags *ioActionFlags,
const AudioTimeStamp *inTimeStamp,
UInt32 inBusNumber,
UInt32 inNumberFrames,
AudioBufferList *ioData) {
//loop through all the buffers that need to be filled
for (int i = 0 ; i < ioData->mNumberBuffers; i++){
AudioBuffer buffer = ioData->mBuffers[i];
UInt32 *frameBuffer = buffer.mData;
AmrDecode(frameBuffer);
}
return noErr;
}

I really need some help
Michael
January 1, 2012 at 3:50 pm

Hi Michael.

after i have put together all this code , how would i get the actual audio data ?
where exactly does it saved ?

and when all this code is done ? i have to put it all into 1 method and then call it ?
or should i only call -start method ?

what operation should be taken to get the real time data ?

i have spent days to understand it but i couldnt .
how exactly i
1. Michael Tyson
  January 8, 2012 at 12:02 pm
  
  Hi, Michael – it sounds like another tutorial might be in order. Please stay tuned, I’ll put one together over the next week or two and post it on the blog.
  1. Arijit
    August 27, 2012 at 7:39 am
    
    Can you put a code
Peter Kramer
March 12, 2012 at 11:35 pm

Jake asked back on November 13, 2010 if there was a way to get

“…input from iPhone mic and play back on bluetooth speaker…”

Is there? If not, why not?
1. Michael Tyson
  March 13, 2012 at 10:07 am
  
  Hi Peter,
  
  I’m not certain – I haven’t played with bluetooth much. I do know that iOS’s audio routing capabilities are pretty limited, so it could go either way. My suggestion is to go check out the audio session documentation, and see what’s there. If it lets you connect a bluetooth speaker independently of the input system, then you should be good to go.
Fabrizio Bartolomucc
March 21, 2012 at 1:31 pm

I am still looking for a way to play sounds based on the notes of the pentagram. Can you help?
StefanS
April 6, 2012 at 9:44 am

Hello Michael,

This tutorial has helped me a lot. Thank you for that.
My question:
I am currently working on a VoIP application and I want to use SPEEX as a speech coder. This coder specifically asks for a audio buffer of 20 ms, 160 samples and sample rate of 8000Hz. However, I don’t think it is possible to set the buffer length to exactly 20 ms, or am I missing something? And if I set the sample rate to 8000 for the Remote IO unit I get inNumberFrames = 93 or 92. If you do the math, for a buffer of 20 ms and sample rate of 8000, I should get exactly 160 samples. Important note: I am still working in the simulator. Another think, do you think that maybe Audio Queues would be a better solution for a VoIP application?

Thanks a lot,
Stefan
1. Michael Tyson
  April 6, 2012 at 10:40 am
  
  Hi Stefan,
  
  Core Audio’s never that exact – it tries to find the closest parameters to what you request, but it’ll never be exact. If you need exactly 160 samples at a time for the SPEEX conversion, then use a circular buffer to store the audio and process it in chunks of the required size.
  
  No, Audio Queues isn’t as low-latency as Remote IO. You definitely want Remote IO for a VoIP app.
XXX
April 8, 2012 at 12:30 am

StefanS, I try AudioQueue for VoIP. 160 samples is work for 8000Hz. But callback calls every 1-2 ms, but not every 20ms interval. After 25 calls it paused for 512 ms and calls every 1-2 ms again in circle. Michael is right.
1. StefanS
  April 8, 2012 at 9:42 am
  
  Thank you both. I’ll work with audio units (not audio queues) and I’ll try to implement the circular buffer created by Michael. As my work proceeds, I may have some additional questions, I am fairly new to iOS development. :)
XXX
April 11, 2012 at 10:43 am

Hello Michael,

I try to use circular buffer to store audio data and process it in chunks of 20ms samples for VoIP app, but I can’t do it because I need accurate interval 20 ms for send packet with data to network and I don’t know how do it because NSTimer is not so accurate. Can you help with this question? Which timer is better for chunking audio data from your buffer with small interval?
1. Michael Tyson
  April 11, 2012 at 10:45 am
  
  Why would you need to use a timer? Why not just process 20 samples at a time, as they become available in the buffer?
  1. StefanS
    April 11, 2012 at 11:20 am
    
    My question is, If calling of the callback exactly every 20ms is not possible then it wouldn’t be ok to just put the coder into the callback and process the audio -> there’s the problem of synchronization. The codec should be called more frequently than the callback. Where should I put the codec and how to schedule it?
    Anyone? :)
    
    Actually, I am still having problems setting the hardware sample rate. I set it as 8000Hz, and when I initialize the audio session I get its value (it says 8000Hz so that’s ok) but then somehow my application changes this value to 44100Hz (I hear it).
    So, in my callbacks the inNumberFrames is 512 (for 44100) and if I try to set the Audio Unit’s sample rate to 8000 the inNumberFrames becomes 93 93 92 (the value is not constant). Does anyone have any idea how this happens? Could this be a Simulator related problem? For 8000Hz and a buffer duration of 20ms one should get exactly 160 samples.
    
    Thanks a lot,
    Stefan
    1. Michael Tyson
      April 11, 2012 at 11:26 am
      
      The simulator can behave very differently to the device. When working with audio, always have a device handy, because you’ll see dramatically different effects. You can use the simulator sometimes, but unless you’re doing most of your testing using the device, you’re just making life insanely hard for yourself.
      
      As for processing the buffer, just process it in blocks of 20 samples. I don’t really understand why there’s a synchronisation problem…Or why you’re limited to processing just one 20 sample block per callback. Just loop!
      
      Whether you do it on the realtime thread in the callback, or in an offline processing thread is up to you – it depends on whether the coder is suitable for use in a realtime context (ie. whether it holds locks, allocates memory, takes a long time, etc.).
  2. XXX
    April 11, 2012 at 12:31 pm
    
    Is audio data passed to the buffer via callback with intervals of 20ms? This solves the problem. Or may be audio data comes to buffer with different intervals (20ms differs +/-5-10ms)?
StefanS
April 17, 2012 at 9:54 am

Hello,

Does anyone here know how the iLBC codec is used? Apparently, my Convertor does not accept when the mFormatID from the AudioStreamBasicDescription is set to kAudioFormatiLBC.

Thanks,
Stefan
owen
May 10, 2012 at 4:03 pm

Im trying to use this code however i get undeclared identifiers for almost all data types. i have looked them up and they seem to be in the AudioUnit.framework, however that framework is added to my link binarys with libraries so i dont understand why the data types arent recognized
for example at the very top (the first 2 lines)
AudioComponentInstance audioUnit;
AudioComponentDescription desc;
both are “undeclared identifier”
1. owen
  May 10, 2012 at 6:10 pm
  
  answer: you not only need to include it in the link libraries page but also add
  
  import
StefanS
May 11, 2012 at 8:37 am

Hello Michael,

About this:

// Disable buffer allocation for the recorder (optional – do this if we want to pass in our own)
flag = 0;
status = AudioUnitSetProperty(audioUnit,
kAudioUnitProperty_ShouldAllocateBuffer,
kAudioUnitScope_Output,
kInputBus,
&flag,
sizeof(flag));

What buffer does it refer to? I see no difference in the behavior of my application if I decide to disable it or not. I use temporary AudioBuffer and AudioBufferList to store the input data and then copy this data to the Circular buffer you have provided.

Another question: About this Voice-Processing IO Audio Unit and its acoustic echo cancellation. Do I simply use it in my code and this nice echo cancellation effect “magically” appears, or should I do some special configuration beforehand? Does it work at all?

Thank you, you’ve been such a help to me and my beginnings in iOS Audio development.
Stefan
1. Michael Tyson
  May 11, 2012 at 10:12 am
  
  Hi Stefan,
  
  That refers to the audio unit’s own internal buffer – it’s really quite a minor detail, but it saves a little memory allocation if you’re providing your own buffers instead. If in doubt, it’s save to leave it out, though.
  
  Yep, you’ll get echo cancellation for free, as soon as you start using VPIO.
  
  You’re welcome =)
StefanS
May 15, 2012 at 3:12 pm

Hello Michael,

I finally got the chance to try my application on a device and not just the simulator. It all works perfectly, except a minor delay, which I will look into.

My question: how can I output the audio through the speakers (the loud ones, so I can achieve a handsfree functionality)? So far, I can only hear the audio through the headphones or If I press my ear against the phone as in a standard conversation.

As always,
Thanks:)
Stefan
owen
May 15, 2012 at 4:37 pm
I hav e struggled to get this work for a while now and its driving me nuts. However now I am kinda worried because after reading thought he comments I see you posted:
“Hey Rarejai – The ULaw format is for storage only, for use with things like the Audio File Services. Remote IO only works with PCM.”
Which is what im trying to do (stream u-law audio) from the mic.
I guess my question is if thats the case then what would this do?
AudioStreamBasicDescription audioFormat;
audioFormat.mSampleRate = 8000.00;//44100.00;
audioFormat.mFormatID = kAudioFormatULaw;
// audioFormat.mFormatFlags = kAudioFormatFlagIsSignedInteger | kAudioFormatFlagIsPacked;
audioFormat.mFramesPerPacket = 1;
audioFormat.mChannelsPerFrame = 1;
audioFormat.mBitsPerChannel = 16;
audioFormat.mBytesPerPacket = 2;
audioFormat.mBytesPerFrame = 2;

I also have this question open http://stackoverflow.com/questions/10501236/stream-media-from-iphone because when I use a variation of this code, nothing happens:
void audioDataReceiver (AudioBufferList bufferList)
{
double *q = (double *)(&bufferList)->mBuffers[0].mData;
```
queue = [[NSOperationQueue alloc]init];

for(int i=0; i < strlen((const char *)(&bufferList)->mBuffers[0].mData); i++)
{

    NSData * dataBuffer = [NSData dataWithBytes:&q[i] length:sizeof(double)];
```
// NSData * dataBuffer =[NSData dataWithBytes:(&bufferList)->mBuffers[0].mData length:sizeof((&bufferList)->mBuffers[0].mData)];
```
    client= [AFHTTPClient clientWithBaseURL:[NSURL URLWithString:[NSString stringWithFormat:@"https://%@/",serverAddress]]];

    NSMutableDictionary *parameters = [NSMutableDictionary dictionary];

            NSMutableURLRequest * request = [client multipartFormRequestWithMethod:@"POST" path:[NSString stringWithFormat:@"transmitaudio?id=%@", sessionID] parameters:parameters constructingBodyWithBlock: ^(id <AFMultipartFormData>formData) 
                     {
```
// [formData appendPartWithFileData:self.audioHandler.dataBuffer name:@”micaudio” fileName:@”sound.caf” mimeType:@”audio/basic”];
```
                         [formData appendData:dataBuffer];                     
```
// NSLog(@”request: %@”,request);
// NSLog(@”client: %@”,client);
}];
[request setValue:@”audio/basic” forHTTPHeaderField:@”content-type”];
[request setValue:@”99999″ forHTTPHeaderField:@”Content-Length”];
[request setValue:@”Keep-Alive” forHTTPHeaderField:@”Connection”];
[request setValue:@”no-cache” forHTTPHeaderField:@”Cache-Control”];
```
    AFHTTPRequestOperation *operation = [[AFHTTPRequestOperation alloc] initWithRequest:request];
    [queue addOperation:operation];
```
// NSLog(@”queue: %@”,queue);
}

}
owen
May 15, 2012 at 4:41 pm

Sorry here is the pastebin for easier reading
http://pastebin.com/hFSNnJct
Philbo
July 25, 2012 at 6:18 pm

Why the AudioUnitRender function in the recordingCallback function? Don’t you already have access to ioData at that point which contains the samples from the mic?
1. Michael Tyson
  July 25, 2012 at 7:00 pm
  
  Nope – the callback just notifies you that input is available. You still have to call AudioUnitRender to pull the actual audio from the audio unit.
Joshua
August 11, 2012 at 9:27 pm

Hi Michael!

Great stuff, wish I’d found this about a month ago when I started on my current project :P

Basically I’ve created a retro video game sound creator for the iPad based on a similar open source pc/mac project.

I’m using remoteIO, and everything’s working fine and dandy in the simulator…however, when I test on an iPad, it randomly reroutes the audio to the headphone jack.

Is there any chance you know what is causing this, and what I could do about it?

Thanks a ton in advance.
1. Michael Tyson
  August 11, 2012 at 9:32 pm
  
  Hey Joshua,
  
  To my knowledge, there’s not currently a way on iOS < 6 to route audio to the speaker while headphones are plugged in. This will change in iOS 6, though.
  1. Joshua
    August 12, 2012 at 12:26 am
    
    Hey, thanks for the response!
    
    I think I was a bit unclear in detailing my problem…basically there is NO headphone plugged in, but my program starts routing sound to the speaker (playing sounds normally) but then suddenly you can’t hear anything.
    
    When I say anything, I mean anything (onscreen keyboard clicks, etc)….When I plug my headphones in, it works perfectly though, and as soon as I take them out, the sound is rerouted to the speakers until my app malfunctions again. :/
    
    I’m using kaudiosessioncategory_mediaplayback, I don’t know if that might have anything to do with it?
    1. Michael Tyson
      August 12, 2012 at 10:28 am
      
      Hmm, I see! Well, I’m afraid I’ve never seen anything like that before. Perhaps try asking on the Core Audio mailing list.
jnpdx
August 15, 2012 at 9:03 pm

I’m chasing my tail with a bug I’m hoping maybe you’ve run into —

I have a couple users (statistically very few) that report that occasionally, they’ll stop getting audio output from my app, although all other apps work. They can force quit the app and it still doesn’t return. The only thing that makes the app make sound again is restarting the device. There’s a visual cue that only gets called during the audio callback, so it’s not that the unit has just stopped — it simply is no longer making sound.

Unfortunately, it seems pretty random, and I’ve never been able to recreate the issue. Does this sound like anything you may have run into?
1. Michael Tyson
  August 15, 2012 at 9:07 pm
  
  I’m afraid I haven’t seen this before, @jnpdx – I would strongly recommend posing this question to the folks on the Core Audio mailing list.
2. Michael Tyson
  August 15, 2012 at 9:21 pm
  
  Oh, one thing to check: Make sure you’re handling audio interruptions properly! E.g. don’t stop your audio session on interruption; start your audio session when your app enters foreground and on interruption end notification.
  Also make sure you’re using the right audio session category.
  1. jnpdx
    August 20, 2012 at 7:13 am
    
    Thanks — my category is kAudioSessionCategory_MediaPlayback, which I would think would be correct.
    
    The interruption listener calls AudioOutputUnitStop, but doesn’t kill the session — does that sound right?
    
    I just posted the question to the CoreAudio list as well. Thanks
    1. Michael Tyson
      August 20, 2012 at 11:00 am
      
      That does sound right indeed – I’m afraid I have no idea. Hopefully someone on the list will!
Gaurav
September 1, 2012 at 7:11 am

Hi Michael,

i am creating a MAC project in which the voice is recorded through Microphone …..
But the problem coming is that it is recorded at sample rate 44100 and i want to record the audio at
16000 sample rate .when i am using 16000 sample rate it is giving me error -10868 in AudioUnitSetProperty method…Please tell me how I can record it at 16000 sample rate and 16 bits.

Please help me out i am totally stuck here

Thanks,
Gaurav
1. Michael Tyson
  September 1, 2012 at 12:31 pm
  
  16000 Hz? Do you mean 16384 Hz? I’m not sure if the device can necessarily manage an odd number like 16000…
Kevin
September 6, 2012 at 10:41 pm

Hi Mike,

Thanks for posting all of this? I have a basic question: is the recording you’re describing (i.e. recordingCallback) for the in-app audio, or is it for the microphone?

Thanks,
Kevin
1. Michael Tyson
  September 6, 2012 at 10:44 pm
  
  Hey Kevin – that’s microphone (or other audio input). For recording the audio your app produces, you’ll need to set a render callback on the output scope of the audio unit, and do something with the buffer.
  1. Kevin
    September 6, 2012 at 11:06 pm
    
    Thanks for explaining that Mike. Just to be dig a little further: I’d like to record the application’s audio, but I haven’t built any of my own audio graphs (everything is done via OpenAL). Using RemoteIO won’t help here, right?
    1. Michael Tyson
      September 7, 2012 at 2:52 pm
      
      I don’t have any actual experience with OpenAL, unfortunately. If you have a way of accessing the Audio Unit, then you’re good to go – just add a render callback. Otherwise.. I’m afraid you’re on your own.
Cameron
September 6, 2012 at 11:21 pm

What do you have to #import to make this work?
1. Michael Tyson
  September 7, 2012 at 3:04 pm
  
  AudioToolbox/AudioToolbox.h
hellosns
September 30, 2012 at 7:49 am

Hi Mike,
Thanks for posting this, it really help me a lot.
i have one problem, how you resolve the feedback loop problem where using external mic and speak.

thanks.
1. Michael Tyson
  September 30, 2012 at 11:06 am
  
  No problem!
  
  Short answer is, you don’t. Longer answer is: Use headphones or a noise-discriminating mic. In a pinch, use the VPIO unit, but echo removal in general can do pretty nasty things to audio signals.
  1. hellosns
    September 30, 2012 at 3:31 pm
    
    thanks reply.
    the noise exist, so i have to solve the problem. :-)
    when using remoteIO, i set volume slower to remove noise,but effect was not good.
    when i used VPIO, it’s like all were done, the noise was gone.
    the official document point out that VPIO adds echo suppression for two-way duplex communication.
    it mean that i just set VPIO subType can solve the problem, right?
    
    thanks again.
  2. Michael Tyson
    September 30, 2012 at 5:01 pm
    
    Yep, that’s right.
    1. hellosns
      October 1, 2012 at 1:36 pm
      
      ok,thanks.
      i had seen a solution from a karaoke like app that make voice from micphone silence and make accompaniment mp3 go through speak, so there is no feedback loop exist. Important thing is that voice from micphone and accompaniment mp3 can be mixed correctly.
      i can’t think out how to implement this solution, maybe they didn’t use audio unit.
      i wonder whether you have idear ?
      
      thanks.
Manish
October 2, 2012 at 2:15 am

Hi Micheal,
First of all a great post.
An issue I am facing here is that I get an audio ouput on the speaker when the remoteio device is not connected to mic input.
I am not setting any output callback assuming I dont need one.
Everything works fine except that I get a noise on the speaker if my mic input is not connected.
I am pretty sure that I am not generating any system sound in the callback.
Thanks,
Manish
Florian Bomers
October 4, 2012 at 3:49 pm

Hi, very nice article, I’ve used it a while ago as a starting point for an app. However, there is a nasty bug: you’re using AudioUnitUninitialize() to free the “audioUnit”. However, audioUnit is, in fact, an AudioComponentInstance, which must be freed using AudioComponentInstanceDispose(). Would be nice if you fixed your code in the article…

The implication of this bug is that after first usage of the audio component, it stays in memory and occupies an audio channel. Any successive playback of audio (music, ringtone, speaker) will now be ducked in your or other applications. Single-instance audio units will fail to get initialized again. See also my stack overflow post: iOS 5/6: low volume after first usage of CoreAudio.
1. Michael Tyson
  October 4, 2012 at 4:39 pm
  
  Thank you so much for chiming in, Florian – this wasn’t even on my radar. This actually very neatly explains the low volume oddness I was seeing recently when trying to switch between the VPIO unit and the RIO unit. Where were you a couple of months ago ;-)
  
  I’ve updated the post, and thanks again.
Pingback: Again onwards | iOS adventures
Manish
October 15, 2012 at 11:15 pm
Can anyone please recommend the cause of this. I have created a universal app and the weird noise is generated only for the ipad app. The noise starts whenever I sync the remote iO device and stops when I stop. This is all thats happening with in the code apart from setting some buffers :
– (void)startAudio {
```
OSStatus startErr = noErr;    
startErr = AudioOutputUnitStart(remoteIOUnit);
NSAssert(startErr == noErr, @"couldn't start Remote I/O unit");

if (startErr == noErr)
    DebugLog(@"Audio started");
```
}

Hi Micheal, First of all a great post. An issue I am facing here is that I get an audio ouput on the speaker when the remoteio device is not connected to mic input. I am not setting any output callback assuming I dont need one. Everything works fine except that I get a noise on the speaker if my mic input is not connected. I am pretty sure that I am not generating any system sound in the callback. Thanks, Manish
1. Michael Tyson
  October 16, 2012 at 11:36 am
  
  Thanks, Manish. Hmm – if you’re not generating any output, you probably need to set the OutputIsSilence flag in the callback, and maybe even memset the buffers to 0. My memory’s a bit fuzzy on that, though.
Phil
October 23, 2012 at 3:32 am

I noticed the following comment above.

“Notes: ioData contains buffers (may be more than one!)
Fill them up as much as you can. Remember to set the size value in each buffer to match how much data is in the buffer.”

Is this true and how do you set the size value? I thought that the buffer was allocated for you. Setting the mDataByteSize doesn’t seem to do anything for me if the buffer is not completely full.

Thanks.
Pier
October 25, 2012 at 9:53 am

Thanks so much for this – it got me started on RemoteIO.
Heeyoung
November 23, 2012 at 4:41 am

Hi Michael,

Thank for posting a great article.

I’m working on a VOIP application and facing an issue with the audio. In some case, when I’m changing audio route (Speaker, Headset, Bluetooth,…) or Mute and unmute or Hold and unHold call quickly, then I terminates the call, my thread is stuck when it calls AudioOutputUnitStop and never get back. Do you have any idea why AudioOutputUnitStop is blocked forever? And how to unblock it?

Thanks.
Lantro
December 4, 2012 at 5:26 pm

Hello Michael.

I need to play short sound clips about 0,5 – 1 or 2 seconds, but not recording. Is RemoteIO what I need or there is any other way to get it ?

AudioServices and AVFoundation are very slow for what I need.
Cheers for such a great tutorial, it’s really helpful.

Comments are closed.