Swift FFT - Problema de división compleja

Question

Oct 11, 2015, 05:04 PM

frequency swift fft ios signal-processing

Swift FFT - Problema de división compleja

Estoy tratando de realizarFFT en unaarchivo de audio para encontrar la frecuencia usando el marco Acelerar. He adaptado el código (probablemente incorrecto) de esta pregunta:Espectrograma de AVAudioPCMBuffer usando Accelerate framework en Swift

Aunque, las magnitudes de 'espectro$8Estoy tratando de realizar9$inf'o'yaya', y el 'real'y'imag'componentes de la división dividida imprimir resultados similares; indicando que esta es la causa del problema como: 'magnitud = sqrt (pow (real, 2) + pow (imag2)'. Corrígeme si estoy equivocado, pero creo que el resto del código está bien.

¿Por qué recibo estos resultados y cómo puedo solucionarlo (cuáles deberían ser los componentes divididos) y qué estoy haciendo mal? Tenga en cuenta que soy muy nuevo en FFT y el muestreo y no tengo idea de cómo configurar esto para un archivo de audio, por lo que cualquier ayuda sería muy apreciada. Gracias.

Aquí está el código que estoy usando:

    // get audio file
    let fileURL:NSURL = NSBundle.mainBundle().URLForResource("foo", withExtension: "mp3")!
    let audioFile = try!  AVAudioFile(forReading: fileURL)
    let fileFormat = audioFile.processingFormat
    let frameCount = UInt32(audioFile.length)

    let buffer = AVAudioPCMBuffer(PCMFormat: fileFormat, frameCapacity: frameCount)
    let audioEngine = AVAudioEngine()
    let playerNode = AVAudioPlayerNode()
    audioMixerNode = audioEngine.mainMixerNode

    let bufferSize = Int(frameCount)
    let channels: NSArray = [Int(buffer.format.channelCount)]
    let channelCount = channels.count
    let floats1 = [Int(buffer.frameLength)]
    for var i=0; i<channelCount; ++i {
        channelSamples.append([])
        let firstSample = buffer.format.interleaved ? i : i*bufferSize
        for var j=firstSample; j<bufferSize; j+=buffer.stride*2 {
            channelSamples[i].append(DSPComplex(real: buffer.floatChannelData.memory[j], imag: buffer.floatChannelData.memory[j+buffer.stride]))
        }
    }

    // connect node
    audioEngine.attachNode(playerNode)
    audioEngine.connect(playerNode, to: audioMixerNode, format: playerNode.outputFormatForBus(0))

    // Set up the transform
    let log2n = UInt(round(log2(Double(bufferSize))))
    let fftSetup = vDSP_create_fftsetup(log2n, Int32(kFFTRadix2))

    // Create the complex split value to hold the output of the transform
    // why doesn't this work?
    var realp = [Float](count: bufferSize/2, repeatedValue: 0)
    var imagp = [Float](count: bufferSize/2, repeatedValue: 0)
    var output = DSPSplitComplex(realp: &realp, imagp: &imagp)

    vDSP_ctoz(UnsafePointer(channelSamples), 2, &output, 1, UInt(bufferSize / 2))

    // Do the fast Fourier forward transform
    vDSP_fft_zrip(fftSetup, &output, 1, log2n, Int32(FFT_FORWARD))

    // Convert the complex output to magnitude
    var fft = [Float](count:Int(bufferSize / 2), repeatedValue:0.0)
    let bufferOver2: vDSP_Length = vDSP_Length(bufferSize / 2)
    vDSP_zvmags(&output, 1, &fft, 1, bufferOver2)

    var spectrum = [Float]()
    for var i=0; i<bufferSize/2; ++i {
        let imag = output.imagp[i]
        let real = output.realp[i]
        let magnitude = sqrt(pow(real,2)+pow(imag,2))
        spectrum.append(magnitude) }

    // Release the setup
    vDSP_destroy_fftsetup(fftSetup)