Dvinfo

Introduction

dvinfo is a command‑line utility designed to extract metadata from files encoded in the Digital Video (DV) format. DV is a professional digital video recording format that became popular in the 1990s for consumer and professional use. The dvinfo program reports a wide range of technical details about a DV file, including resolution, frame rate, field order, color space, and other attributes that describe how the video data is stored. The tool is commonly bundled with the dv collection of programs, which also includes dvconvert, dvplayback, dvrec, and others. dvinfo is available on many Unix‑like operating systems, including Linux, FreeBSD, and macOS, and it is distributed under the GNU Lesser General Public License.

The primary purpose of dvinfo is to provide developers, video editors, archivists, and forensic analysts with a quick way to verify the properties of a DV file without requiring a full playback or conversion process. By inspecting the file header and frame headers, dvinfo can determine whether a file is interlaced or progressive, whether it follows the standard 4:2:2 chroma subsampling, and whether it uses standard or non‑standard frame rates. These characteristics are essential for workflow decisions, such as selecting the appropriate encoding pipeline or determining the compatibility of a file with legacy hardware.

Because DV files are typically large (approximately 100 MB per minute for uncompressed 4:1:1 video), and because many archival workflows involve large collections of such files, dvinfo’s lightweight, non‑destructive inspection is a valuable part of the video asset management process.

History and Background

Digital Video Format

Digital Video was introduced by Sony and Panasonic in 1991 as an analog video signal captured and stored in a digital representation. The format was designed to replace analog VHS and Betacam tapes, offering higher quality and easier editing. DV records video at either 29.97 frames per second (NTSC) or 25 frames per second (PAL), using 4:1:1 chroma subsampling for the main data stream and a 4:2:2 subsampling for the audio data. Each frame consists of two fields, each containing 576 or 576 lines, depending on the video standard, and the fields are interlaced in the default configuration.

The DV file format uses a straightforward packetized structure. A file begins with a 1,048,576‑byte header that contains metadata such as the number of frames, frame rate, and other descriptive information. Following the header, the file is organized into 10,000‑byte chunks, each containing the header for a single frame and the frame’s image data. Because of this regular structure, programs that process DV files can read the file sequentially without needing to decompress the entire video.

Development of the dv Suite

The dv suite of tools emerged in the mid‑1990s as a set of open‑source applications that could capture, convert, and play back DV video on Linux. The initial implementation was written by several developers in the FreeBSD community, and later contributions were merged into the GNU project. The suite was released under the GNU Lesser General Public License, which allowed integration with proprietary software and encouraged broad adoption in academic and professional environments.

dvinfo was added to the suite as a lightweight diagnostic tool. Its source code is written in C and relies on the libdv library, which provides functions for reading and writing DV frames. The dvinfo program uses the libdv API to parse the header of a DV file and to iterate over frame headers, collecting statistics about the video stream. The program was designed to be efficient, requiring only a small fraction of the file to determine key properties, but it also supports full‑file scans to generate a more detailed report.

While DV was dominant in the 1990s, the introduction of compressed formats such as DVCPRO, AVCHD, and later XAVC marked a shift toward higher compression ratios and improved color fidelity. Nevertheless, DV remains in use, particularly in archival contexts where uncompressed or minimally compressed video is required for preservation. The dv suite of tools has remained relevant because it can handle a variety of DV derivatives, including DVCPRO and DVCPRO50, which differ mainly in color subsampling and sample depth.

In addition to the traditional DV format, the Linux community developed several libraries and tools that interface with the Video4Linux API for capturing DV from hardware devices. The dvrec program, for instance, uses V4L2 to record directly from a DV capture card, while dvplayback streams DV data over the network. dvinfo complements these programs by providing metadata extraction without the need to access hardware.

Key Concepts

DV File Structure

At the core of dvinfo’s operation is a precise understanding of the DV file layout. A standard DV file can be described as follows:

File header (1,048,576 bytes): contains the global metadata such as frame count, frame rate, and stream format.
Frame chunk (10,000 bytes): each chunk comprises a 40‑byte frame header and a 9,960‑byte frame payload.
Frame header fields: include timecode, drop‑frame indicator, field polarity, and other flags.
Frame payload: consists of the actual video data, stored in 4:1:1 chroma subsampling for the main field and 4:2:2 for the audio field.

The header and frame header structures are defined by the SMPTE 259M standard, and the dv library implements these structures directly. Because the header is always the same size, dvinfo can calculate the number of frames by dividing the file size minus the header size by the frame chunk size.

Interlacing and Field Order

DV captures video in interlaced mode by default. Interlacing means that each frame is composed of two fields: the even lines and the odd lines, captured at slightly different times. The field order can be either top‑first or bottom‑first, and this information is encoded in the frame header. The field polarity flag indicates whether a frame is even or odd. The drop‑frame flag indicates whether a frame has been dropped to maintain synchronization with real‑time playback.

dvinfo reports the field order and interlacing status in its output. This information is crucial for editors who need to deinterlace footage or who wish to preserve the original field structure for timecode alignment.

Color Sampling and Bit Depth

Standard DV uses 4:1:1 chroma subsampling for the primary video field and 4:2:2 for the audio field. In other words, for every four luma samples there is one chroma sample in the first field, and two chroma samples in the second field. DVCPRO uses 4:2:2 subsampling for the entire frame. The bit depth in standard DV is 10 bits per component, which is represented as 12 bits in the DV packet with a bias of 512. DVCPRO extends this to 12 bits per component. dvinfo displays the color sampling format and bit depth for the file, helping users determine whether a file can be used in a particular workflow.

Frame Rate and Timecode

The DV file header contains the frame rate identifier. Common values are 29.97 (NTSC drop‑frame), 29.97 (NTSC non‑drop‑frame), 25 (PAL), and 23.98 (film). dvinfo reports the frame rate in frames per second and indicates whether drop‑frame is used. Timecode information is present in each frame header, providing a way to locate specific frames accurately. The tool prints the starting timecode and the timecode increment per frame, allowing users to calculate the exact position of any frame in the file.

Hardware Support and Capture Devices

DV is typically captured using a DV capture card connected via BNC or RS‑422 interfaces. Many of these cards are supported by the Video4Linux API. The dv suite provides integration with these devices, but dvinfo itself does not require direct access to hardware; it operates purely on file data. However, understanding the capture settings - such as field polarity, resolution, and audio sample rate - helps users interpret dvinfo’s output in the context of the capture workflow.

Command‑Line Interface

General Usage

The dvinfo command accepts one or more file paths as arguments. Its syntax is:

dvinfo [options] file1 [file2 ...]

The program processes each file sequentially, printing a report for each. If no files are specified, dvinfo prints its usage message.

Options

-a or --all: Scan all frames of the file and report statistics such as average luminance, color distribution, and field polarity variations. By default, dvinfo reports only header information.
-c or --compact: Output a single line per file with key values, suitable for scripting and batch processing.
-d or --debug: Enable verbose debugging output, including raw header bytes. This is useful for developers investigating non‑standard DV files.
-f or --frame n: Inspect only frame number n. Frame numbering starts at 0. This option allows quick retrieval of a specific frame’s header data without scanning the entire file.
-h or --help: Display help information and exit.
-v or --version: Print the program’s version number.

Output Format

When invoked without the -c option, dvinfo prints a human‑readable report. The report includes the following sections:

File header: Shows the file size, frame count, frame rate, field polarity, timecode, and other header values.
Frame summary: Lists the number of frames, the average timecode, and whether the file is interlaced.
Color and bit depth: Reports chroma subsampling and bit depth.
Statistics (if -a is specified): Provides averages of luminance, saturation, and field polarity occurrences.

With the -c option, the output is condensed into a single line containing comma‑separated values: file name, size, frame count, frame rate, field polarity, timecode start, timecode end, and color format.

Applications

Video Editing and Post‑Production

Video editors often need to know the technical properties of source footage before ingesting it into a non‑linear editing system. dvinfo allows editors to verify that a DV file matches the required format - particularly the resolution (720×480 for NTSC, 720×576 for PAL) and the frame rate. If a file is interlaced, editors can decide whether to deinterlace or to use an interlaced editing workflow. Additionally, knowing the chroma subsampling informs the editor about potential color grading limitations.

Digital Archiving and Preservation

Archivists who maintain long‑term collections of DV footage rely on dvinfo to confirm that files have not become corrupted or altered. The tool can detect missing frames, mismatched timecode, or unexpected changes in field polarity. By generating a database of dvinfo reports, archivists can maintain a checksum and metadata catalog that facilitates audits and migration planning.

Forensic Video Analysis

Law enforcement agencies sometimes examine DV footage to corroborate evidence. dvinfo provides forensic analysts with critical information about the recording parameters, such as frame rate and drop‑frame status. The timecode information can be used to cross‑reference timestamps in other logs. In some cases, analysts use dvinfo as part of a chain of custody protocol, recording the metadata before any manipulation.

Education and Research

Academic researchers studying video compression, signal processing, or digital media often work with DV data sets. dvinfo offers a lightweight way to extract baseline characteristics from large data sets, enabling the researchers to focus on algorithmic development rather than manual metadata collection.

Software Development and Integration

Developers building video processing pipelines frequently use dvinfo to validate input files. Because dvinfo is command‑line based, it can be invoked from scripts or integrated into continuous‑integration environments. The output is easily parsed by programs written in Python, Bash, or other languages, allowing automated workflows to react to specific properties, such as rejecting a file if it is not 720×480.

Performance Considerations

Memory Footprint

dvinfo is a streaming application that reads only the necessary portions of a file at a time. The default mode reads the 1‑MB header and then processes each frame header on demand. Even when the -a option is used, dvinfo allocates a small buffer per frame, typically 10 KB, keeping the memory usage below 2 MB for most files.

Execution Time

The time taken by dvinfo depends largely on the number of frames to be scanned. For a typical 10‑minute DV file (~100 MB) containing 1,800 frames, the default header‑only mode completes in under 0.5 seconds on a modern processor. When full scanning is requested, the tool may take 1–2 seconds depending on disk I/O speed. The algorithmic complexity is linear in the number of frames, as dvinfo must read each frame header once.

I/O Characteristics

dvinfo performs sequential reads, which aligns well with the access patterns of spinning disks and solid‑state drives. The tool does not attempt to random‑access frames, so the I/O overhead is minimal. On systems where the file is stored on a network file system, the sequential read behavior can also be advantageous by reducing seek times.

Compatibility and Limitations

Supported Formats

dvinfo is designed to work with files that conform to the SMPTE 259M specification. It can handle standard DV, DVCPRO, DVCPRO50, and certain variants that use 4:2:2 chroma subsampling. The tool attempts to detect the format by inspecting the frame header and the chroma subsampling indicator. If a file uses a non‑standard format - such as a proprietary format used by a specific camera - dvinfo may report an error or produce incomplete metadata.

Error Handling

If dvinfo encounters a file that is too small to contain a valid header, it prints an error message: “Invalid DV file: header too short.” The tool also checks that the file size minus header size is divisible by the frame chunk size. If not, it reports an error: “File size inconsistent with frame count.” These checks help detect truncated or corrupted files.

Non‑Standard Timecode

Some DV files contain timecodes that do not start at 00:00:00:00, or that use a different timecode base. dvinfo prints the timecode base as read from the header. If the timecode does not align with the expected base, dvinfo reports a warning. However, the tool does not validate that the timecode is globally unique across the file; such validation is left to higher‑level tools.

Future Extensions

Potential future enhancements for dvinfo include support for higher‑bit‑depth DV variants, more detailed color statistics, and integration with the newer SMPTE 428‑2 standard for higher‑resolution DV. However, these extensions would require updates to the underlying dv library and potentially changes to the command‑line interface.

Example Workflow

Suppose a production house needs to ingest dozens of DV files into a video editing system that requires 720×480 NTSC footage at 29.97 non‑drop‑frame. The following Bash script demonstrates how dvinfo can be used to filter out unsuitable files:

#!/usr/bin/env bash
set -euo pipefail

for file in "$@"; do
    read -r name size frames rate field polarity start end format

In this script, dvinfo’s compact output is parsed to ensure the file meets the required format before further processing.

Development and Contribution

Source Code

The dvinfo binary is distributed as part of the dvutils package, which is hosted on GitHub. The repository contains the source files, a Makefile, and unit tests. Developers can clone the repository and build dvinfo with:

git clone https://github.com/multimedia/dvutils.git
cd dvutils
make
sudo make install

Testing

The test suite includes a collection of synthetic DV files covering various combinations of frame rate, field polarity, and chroma subsampling. Running make check executes these tests, verifying that dvinfo reports the expected values. The test harness also exercises the debug and all‑frame options.

License

dvinfo is released under the GNU Lesser General Public License version 2.1. The LGPL permits integration into proprietary software under the LGPL’s linking exception, making it suitable for commercial use.

dv

The dv program is the primary non‑linear editing ingestion tool for DV footage. While dvinfo provides metadata extraction, dv performs format conversion, deinterlacing, and audio extraction.

ffprobe

FFmpeg’s ffprobe can also parse DV files and report frame rate, resolution, and interlacing. However, ffprobe requires the full FFmpeg library, which may be unnecessary for users who only need DV metadata. dvinfo offers a lighter, more focused alternative.

ffmpeg

FFmpeg can be used to convert DV files to other formats, such as MXF or MP4. Before performing conversion, users often invoke dvinfo to verify that the input matches the expected properties.

Timecode Tools

Tools such as tcprobe and tctools specialize in timecode extraction from video files. dvinfo can be used in conjunction with these tools for a comprehensive timecode audit.

Future Directions

Extended Statistics

Future releases may incorporate more advanced statistics, such as per‑color histogram, field polarity correlation, and audio sample analysis. These features would benefit researchers and developers working on compression or signal analysis.

GUI Integration

While dvinfo is a command‑line tool, a lightweight graphical front‑end could make the metadata more accessible to non‑technical users. A simple GTK or Qt application could wrap dvinfo and present the output in a table, providing quick filtering and search capabilities.

Cross‑Platform Support

The current implementation focuses on Unix‑like systems. Adding a Windows port - using Cygwin or MinGW - would broaden the user base, particularly among editors who use Windows‑based NLEs.

Conclusion

dvinfo is a compact, efficient, and versatile utility for extracting detailed metadata from Digital Video files. Its adherence to the SMPTE 259M standard and integration with the Video4Linux ecosystem make it a reliable companion in editing, archiving, forensic, and research workflows. By providing accurate information about interlacing, frame rate, color sampling, and timecode, dvinfo empowers users to make informed decisions about how best to ingest and process DV footage.

Search

Table of Contents