Identifying file formats with fido
fido (Format Identification for Digital Objects) is a command-line tool used to identify the formats of digital objects. It is developed by the Open Preservation Foundation and compares file signatures against the PRONOM registry to determine a file's format.
Installation
To install fido using pip, run the following:
pip install opf-fido
If the installation is successful, you should be able to run the following to see fido's help message:
fido -h
Identify an individual file's format
The simplest way to identify a single file's format using fido is by piping a filepath in as stdin
using the ls
command:
ls /mnt/dps/david/sample_file.abc | fido -input -
The above command converts the full filepath of sample_file.abc
to stdin
, which is then fed into the fido process itself. "-input -
" tells fido to treat the stdin
as a file to be analyzed.
In this example, fido is able to correctly identify sample_file.abc
as a Microsoft Office Excel file, even though the .abc
file extension is not one that is used by Excel. You can test this yourself by changing the file extension of any working file to a different extension and running it through fido. The fido output should show the original and correct file format information, whatever the extension.
Welcome to the University Wiki Service! Please use your IID (yourEID@eid.utexas.edu) when prompted for your email address during login or click here to enter your EID. If you are experiencing any issues loading content on pages, please try these steps to clear your browser cache.