Why is the audio out of sync after converting?

Common causes: (1) variable framerate source rendered as constant (use -vsync vfr to preserve VFR); (2) different audio sample rates not resampled (add -ar 48000 ); (3) container limitations (MP4 with variable framerate misbehaves — prefer MKV during editing, encode to MP4 only at the end). Always run ffprobe on both source and output to compare timing.

Why is the audio out of sync after converting?

Common causes: (1) variable framerate source rendered as constant (use -vsync vfr to preserve VFR); (2) different audio sample rates not resampled (add -ar 48000 ); (3) container limitations (MP4 with variable framerate misbehaves — prefer MKV during editing, encode to MP4 only at the end). Always run ffprobe on both source and output to compare timing.

Guide

FLAC: Free Lossless Audio Codec — Technical Deep Dive

PC By Pablo Cirre • Updated May 7, 2026

FLAC: Free Lossless Audio Codec — Technical Deep Dive

FLAC (Free Lossless Audio Codec) was developed by Josh Coalson in 2001 and is maintained by the Xiph.Org Foundation. Unlike lossy codecs that discard audio data, FLAC reproduces bitwise-identical audio on decode — every sample you put in comes back out exactly unchanged. It typically achieves 50–60 % compression on CD audio, roughly halving WAV file sizes while preserving every original waveform sample.

Stream Structure

A FLAC stream begins with the 4-byte magic fLaC, followed by one or more metadata blocks, then a sequence of audio frames.

Metadata Blocks

Each metadata block has a 4-byte header: 1 bit is-last flag, 7-bit block type, 24-bit length. Types:

Type	Decimal	Description
STREAMINFO	0	Mandatory. Sample rate, channels, bits per sample, total samples, MD5
PADDING	1	Reserved space for tag updates without rewriting
APPLICATION	2	Application-specific data (third-party use)
SEEKTABLE	3	Seek point table for fast random access
VORBIS_COMMENT	4	UTF-8 key=value tags (artist, title, album, etc.)
CUESHEET	5	CD layout with track offsets and ISRCs
PICTURE	6	Embedded cover art (same format as ID3v2 APIC)

The STREAMINFO block (34 bytes) encodes, packed in bits: minimum/maximum block sizes (16 bits each), minimum/maximum frame sizes (24 bits each), sample rate (20 bits, up to 655,050 Hz), number of channels minus one (3 bits, up to 8 channels), bits per sample minus one (5 bits, 4–32 bps), total samples (36 bits), and a 128-bit MD5 signature of the unencoded audio.

Audio Frame Structure

Each FLAC frame contains:

Frame header — sync code 0xFF 0xF8, blocking strategy, block size, sample rate, channel assignment, sample size, frame/sample number, header CRC-8
Subframes — one per channel, each with its own subframe type and encoding
Frame footer — CRC-16 of the entire frame

Subframe Encoding Types

FLAC's compression core lies in the subframe predictor, which exploits temporal correlation:

Type	Code	Description
SUBFRAME_CONSTANT	0	All samples identical — stores only one value
SUBFRAME_VERBATIM	1	Uncompressed raw samples — fallback when prediction fails
SUBFRAME_FIXED	8–12	Polynomial predictor, order 0–4, fixed coefficients
SUBFRAME_LPC	32–63	FIR linear predictor, order 1–32, quantised coefficients

For FIXED subframes, the predictor order selects pre-defined integer coefficients. For LPC subframes, the encoder calculates optimal real-valued FIR coefficients via autocorrelation (Levinson-Durbin algorithm), then quantises them to integer precision. Higher orders model more complex patterns but add coefficient overhead.

After prediction, the residuals (prediction errors) are encoded with Golomb-Rice coding — a near-optimal entropy coder for Laplacian-distributed residuals. The encoder partitions residuals into segments, selecting the Rice parameter that minimises bits.

Channel Coupling

Stereo files can use decorrelated stereo modes to improve compression when L and R channels are correlated (as in most music):

Independent: L and R encoded separately
Mid-side: encodes (L+R)/2 as mid and (L−R)/2 as side — usually compresses better when channels are similar
Left-side: stores L and the difference L−R
Right-side: stores R and the difference L−R

The encoder compares all applicable modes and selects the one producing the smallest frame.

Compression Levels

FLAC compression levels (-0 to -8) trade encode speed for file size by adjusting predictor order, block size, and search effort:

Level	Block size	Max LPC order	Rice search	Typical ratio	Speed
-0	1152	FIXED only	2	~63% of WAV	Fastest
-1	1152	0	2	~61%	Very fast
-2	4096	0	0	~59%	Fast
-4	4096	8	0	~57%	Medium
-5	4096	8	0	~55%	Default
-6	4096	12	0	~54%	Slow
-8	4096	32	0	~53%	Slowest

The difference between -0 and -8 is typically only 5–10% smaller files at 10× the encode time — for archival purposes, -8 is worth it; for real-time encoding, -0 or -2 suffices.

FFmpeg FLAC Encoding

# WAV to FLAC, maximum compression, keep all tags
ffmpeg -i input.wav -c:a flac -compression_level 8 output.flac

# CD rip (16-bit 44100 Hz) to high-res FLAC (24-bit 96 kHz upscale — same data, different container)
# Note: upsampling does not add quality; use only to match a target format
ffmpeg -i input.wav -c:a flac -ar 96000 -sample_fmt s32 output_24bit.flac

# Batch convert MP3 to FLAC (preserves dynamic range but quality capped at source MP3 lossy encoding)
for f in *.mp3; do ffmpeg -i "$f" -c:a flac "${f%.mp3}.flac"; done

# Embed cover art into existing FLAC
ffmpeg -i audio.flac -i cover.jpg \
  -map 0:a -map 1:v \
  -c:a copy -c:v copy \
  -metadata:s:v comment="Cover (front)" \
  with_cover.flac

# Extract embedded cover art
ffmpeg -i audio.flac -an -vcodec copy cover_extracted.jpg

Python: Reading and Writing FLAC

import soundfile as sf
import numpy as np

# Read FLAC — returns float64 array by default
samples, rate = sf.read('track.flac')
print(f"Rate: {rate} Hz | Shape: {samples.shape} | dtype: {samples.dtype}")

# Write lossless 24-bit FLAC
sf.write('output_24bit.flac', samples, rate, subtype='PCM_24')

# Available FLAC subtypes: PCM_16, PCM_24, PCM_32

# ---- Tag manipulation with mutagen ----
from mutagen.flac import FLAC

audio = FLAC('track.flac')

# Read stream info
print(f"Sample rate : {audio.info.sample_rate} Hz")
print(f"Channels    : {audio.info.channels}")
print(f"Bits/sample : {audio.info.bits_per_sample}")
print(f"Duration    : {audio.info.length:.2f} s")

# Read / write VorbisComment tags
print("Title:", audio.get('title', ['(none)'])[0])
audio['artist'] = ['New Artist Name']
audio['album']  = ['Remastered Edition 2025']
audio.save()

Parsing the FLAC File Structure in Python

import struct, hashlib

def read_flac_metadata(path: str) -> dict:
    info = {}
    with open(path, 'rb') as f:
        magic = f.read(4)
        if magic != b'fLaC':
            raise ValueError(f"Not a FLAC file: {magic!r}")

        while True:
            hdr = f.read(4)
            if len(hdr) < 4:
                break
            is_last   = bool(hdr[0] & 0x80)
            block_type = hdr[0] & 0x7F
            length    = struct.unpack('>I', b'\x00' + hdr[1:])[0]
            data      = f.read(length)

            if block_type == 0:  # STREAMINFO
                # Unpack the 34-byte STREAMINFO block
                min_bs, max_bs = struct.unpack('>HH', data[0:4])
                # min/max frame size stored as 24-bit big-endian
                min_fs = int.from_bytes(data[4:7], 'big')
                max_fs = int.from_bytes(data[7:10], 'big')
                # sample_rate(20) | channels(3) | bps(5) packed in 4 bytes starting at offset 10
                packed = int.from_bytes(data[10:14], 'big')
                info['sample_rate'] = (packed >> 12) & 0xFFFFF
                info['channels']    = ((packed >> 9) & 0x7) + 1
                info['bps']         = ((packed >> 4) & 0x1F) + 1
                # total_samples: bottom 4 bits of data[13] + data[14:18] = 36 bits
                total = ((data[13] & 0x0F) << 32) | int.from_bytes(data[14:18], 'big')
                info['total_samples'] = total
                info['md5'] = data[18:34].hex()
            elif block_type == 4:  # VORBIS_COMMENT
                # Length-prefixed UTF-8 strings
                vendor_len = struct.unpack('<I', data[0:4])[0]
                vendor = data[4:4+vendor_len].decode('utf-8', errors='replace')
                info['vendor'] = vendor

            if is_last:
                break

    return info

meta = read_flac_metadata('track.flac')
print(meta)
# {'sample_rate': 44100, 'channels': 2, 'bps': 16, 'total_samples': 14495700, 'md5': '...', 'vendor': 'reference libFLAC 1.4.3 20230624'}

FLAC vs Other Lossless Formats

Format	Lossless	Typical ratio	Container	Streaming	OS support
FLAC	Yes	~55% of WAV	Native / OGG	HTTP range	Linux/Win/Mac (native)
ALAC	Yes	~60% of WAV	MP4/M4A	HLS	Apple ecosystem native
WAV	Yes (PCM)	100%	RIFF	Limited	Universal
AIFF	Yes (PCM)	100%	IFF	Limited	Apple native
WavPack	Yes (+lossy hybrid)	~55%	.wv	No	Limited
APE (Monkey's)	Yes	~50%	.ape	No	Limited hardware support

FLAC is the best default lossless format for non-Apple ecosystems due to open patents, wide hardware support (DAPs, NAS devices, receivers), and rich metadata via Vorbis Comment.

Common Pitfalls

Converting lossy → FLAC does not restore quality. FLAC is lossless — it preserves exactly what it receives. If the source MP3 has artefacts, the FLAC will contain those same artefacts. The only benefit is a lossless container for future re-processing without additional generation loss.
24-bit FLAC from 16-bit source adds no information. Up-sampling bit depth zero-pads the lower bits. The sonic result is identical; file size increases.
FLAC is not natively streamable. Unlike HLS/DASH with ALAC-in-MP4, FLAC requires HTTP range requests or a custom player. For lossless Apple Music streaming, Apple uses ALAC in fragmented MP4.
Tag the ALBUM ARTIST field when building music libraries. Many players and DAPs use albumartist for compilation grouping, not artist.

Related conversions

Common video conversions that pair well with this guide:

Related conversions

Put what you just learned into practice — convert your files now in seconds, free and without registration.

Convert →

Convert →

Convert →

Convert →

Convert →

Convert →

Frequently Asked Questions

No. FLAC is lossless — it stores and retrieves samples exactly as given. Converting an MP3 to FLAC does not recover the frequencies discarded during MP3 encoding. The resulting file will be larger and in a lossless container, but the audio fidelity is identical to the MP3 source. The only practical reason to do this conversion is to avoid introducing additional generation loss in future edits — processing a FLAC rather than an MP3 prevents re-encoding artefacts from accumulating.

No. FLAC is sem perdas — it stores e retrieves samples exactly as given. convertendo an MP3 to FLAC does not recover the frequencies discarded durante MP3 codificação. The resulting arquivo will be larger e em um sem perdas container, mas the audio fidelity is identical para o MP3 source. The only practical reason to do this conversion is to avoid introducing additional generation loss in future edits — processing a FLAC em vez de an MP3 prevents re-encoding artefacts de accumulating.

No. FLAC is verlustfrei — it stores und retrieves samples exactly as given. umwandelnd an MP3 to FLAC does not recover the frequencies discarded während MP3 Codierung. The resulting Datei will be larger und in einem verlustfrei Container, aber the audio fidelity is identical zum MP3 source. The only practical reason to do this conversion is to avoid introducing additional generation loss in future edits — processing a FLAC rather than an MP3 prevents re-encoding artefacts von accumulating.

No. FLAC is sin pérdidas — it stores y retrieves samples exactly as given. convirtiendo an MP3 to FLAC does not recover the frequencies discarded durante MP3 codificación. The resulting archivo will be larger y en un sin pérdidas contenedor, pero the audio fidelity is identical al MP3 source. The only practical reason to do this conversion is to avoid introducing additional generation loss in future edits — processing a FLAC rather than an MP3 prevents re-encoding artefacts de accumulating.

AV1 is the most efficient (royalty-free, ~30% smaller than H.265) but encoding is slow. H.265 (HEVC) saves ~30–50% over H.264 and is supported by every modern phone and desktop. H.264 remains the safest baseline for legacy compatibility. Rule of thumb: archives → AV1, daily use → H.265, broadest reach → H.264.

Level 5 (the default in most encoders) is the best balance for almost all cases: it delivers roughly 55% of the original WAV size with moderate encode speed. Use level 8 for archival storage where you want maximum space savings and can afford slow encoding. Use level 0 or 1 for real-time capture (recording audio on-the-fly) or when processing throughput matters more than file size. The quality of the decoded audio is identical regardless of compression level — only the file size and encode time differ.

Level 5 (the default in most encoders) is the best balance para almost all cases: it delivers roughly 55% of the original WAV size com moderate encode speed. usar level 8 para archival storage where you want máximo space savings e can afford slow codificação. usar level 0 ou 1 para real-time capture (recording audio on-the-fly) ou when processing throughput matters more than tamanho do arquivo. a qualidade of the decoded audio is identical regardless of compressão level — only the tamanho do arquivo e encode time differ.

Level 5 (the default in most encoders) is the best balance für almost all cases: it delivers roughly 55% des original WAV size mit moderate encode speed. verwenden level 8 für archival storage where you want maximal space savings und can afford slow Codierung. verwenden level 0 oder 1 für real-time capture (recording audio on-the-fly) oder when processing throughput matters more than Dateigröße. die Qualität des decoded audio is identical regardless von Komprimierung level — only the Dateigröße und encode time differ.

Level 5 (the default in most encoders) is the best balance para almost all cases: it delivers roughly 55% del original WAV size con moderate encode speed. usar level 8 para archival storage where you want máximo space savings y can afford slow codificación. usar level 0 o 1 para real-time capture (recording audio on-the-fly) o when processing throughput matters more than tamaño de archivo. la calidad del decoded audio is identical regardless de compresión level — only the tamaño de archivo y encode time differ.

CRF (Constant Rate Factor) is the best default for offline files: ffmpeg picks the bitrate frame-by-frame to maintain perceived quality. Two-pass is only better when you must hit an exact final size (DVD targets). Constant bitrate is for streaming with a fixed channel. For "smallest at quality X" always use CRF.

ALAC (Apple Lossless) is natively supported by iOS, macOS, tvOS, watchOS, and AirPlay without any additional software — including Apple Music streaming. If your primary ecosystem is Apple (iPhone, iPad, Apple TV, HomePod), ALAC in an M4A container is the path of least resistance. FLAC is better for cross-platform libraries, NAS-based music servers (Navidrome, Jellyfin), Android devices, and high-end DAPs (Fiio, Astell&Kern). Most modern Apple devices also support FLAC natively since iOS 11 / macOS 10.13, so the gap has narrowed significantly.

ALAC (Apple sem perdas) is natively suportado por iOS, macOS, tvOS, watchOS, e AirPlay sem any additional software — including Apple Music streaming. If your primary ecosystem is Apple (iPhone, iPad, Apple TV, HomePod), ALAC in an M4A container is the path of least resistance. FLAC is better para cross-platform libraries, NAS-based music servers (Navidrome, Jellyfin), Android devices, e alta-end DAPs (Fiio, Astell&Kern). Most moderno Apple devices also support FLAC natively since iOS 11 / macOS 10.13, so the gap has narrowed significantly.

ALAC (Apple verlustfrei) is natively unterstützt by iOS, macOS, tvOS, watchOS, und AirPlay ohne any additional Software — including Apple Music streaming. If your primary ecosystem is Apple (iPhone, iPad, Apple TV, HomePod), ALAC in an M4A Container is the path von least resistance. FLAC is better für cross-platform libraries, NAS-based music servers (Navidrome, Jellyfin), Android devices, und hoch-end DAPs (Fiio, Astell&Kern). Most modern Apple devices also support FLAC natively since iOS 11 / macOS 10.13, so the gap has narrowed significantly.

Common causes: (1) variable framerate source rendered as constant (use <code>-vsync vfr</code> to preserve VFR); (2) different audio sample rates not resampled (add <code>-ar 48000</code>); (3) container limitations (MP4 with variable framerate misbehaves — prefer MKV during editing, encode to MP4 only at the end). Always run <code>ffprobe</code> on both source and output to compare timing.

A typical 4-minute CD-quality track (44.1 kHz, 16-bit stereo) stored as WAV is about 40 MB. As FLAC at the default compression level, it compresses to roughly 22–28 MB (55–70% of WAV). As MP3 at 320 kbps, the same track is about 10 MB; at 128 kbps it is about 4 MB. So FLAC is roughly 2.5–7× larger than MP3 for the same content. For a 1,000-track music library: ~280 GB as FLAC vs ~40 GB as MP3 320kbps vs ~16 GB as MP3 128kbps.

A typical 4-minute CD-quality track (44.1 kHz, 16-bit stereo) stored as WAV is about 40 MB. As FLAC at the default compressão level, it compresses to roughly 22–28 MB (55–70% of WAV). As MP3 at 320 kbps, the same track is about 10 MB; at 128 kbps it is about 4 MB. So FLAC is roughly 2.5–7× maior que MP3 para the same content. para a 1,000-track music library: ~280 GB as FLAC vs ~40 GB as MP3 320kbps vs ~16 GB as MP3 128kbps.

A typical 4-minute CD-quality track (44.1 kHz, 16-bit stereo) stored as WAV is about 40 MB. As FLAC at the default Komprimierung level, it compresses to roughly 22–28 MB (55–70% von WAV). As MP3 at 320 kbps, the same track is about 10 MB; at 128 kbps it is about 4 MB. So FLAC is roughly 2.5–7× größer als MP3 für the same content. für a 1,000-track music library: ~280 GB as FLAC vs ~40 GB as MP3 320kbps vs ~16 GB as MP3 128kbps.

A typical 4-minute CD-quality track (44.1 kHz, 16-bit stereo) stored as WAV is about 40 MB. As FLAC at the default compresión level, it compresses to roughly 22–28 MB (55–70% de WAV). As MP3 at 320 kbps, the same track is about 10 MB; at 128 kbps it is about 4 MB. So FLAC is roughly 2.5–7× más grande que MP3 para the same content. para a 1,000-track music library: ~280 GB as FLAC vs ~40 GB as MP3 320kbps vs ~16 GB as MP3 128kbps.

Yes if you only change the container: <code>ffmpeg -i in.mkv -c copy out.mp4</code>. This remuxes the stream without re-encoding, takes seconds even for hours of footage. Limitations: codec must be supported by the target container (e.g. you cannot put H.264 in WebM, only VP8/VP9/AV1). To shrink size you must re-encode.