jpegextractor, a command line tool to extract JPEG streams from arbitrary files or standard input. I offer a page on the JPEG file format as well, if you are interested in background information on that format.
Several file formats can include images as JPEG streams, e.g. PDF document files or ACDSee image database thumbnail files (image_db.dtf). In order to get to those JPEGs, it was necessary to either have a program that knows the file format and can extract the JPEGs from the right places, or to use a hex editor and copy binary data "manually".
jpegextractor has yet another approach, it uses the fact that valid binary JPEG streams start with the byte sequence (given as values in hexadecimal notation) ff d8 ff and end with ff d9. It copies all of those streams to new files. As jpegextractor simply looks for the two sequences it does not have to know the format of the encapsulating file and thus works with all formats that embed JPEG streams. Switches
Call the program with --help as single parameter and you will get the following help screen:
Usage: java jpegextractor <OPTIONS> [FILEs] Extract embedded JPEG streams from arbitrary files or standard input.
- Options
-H, --help Print this help screen and terminate. -d, --digits NUM Pad numbers in output files to NUM digits. -D, --outputdirectory DIR Write to directory DIR (default: "."). -p, --prefix P Use P as output prefix (default: "output"). -s, --suffix S Use S as output suffix (default: ".jpg"). -n, --initialnumber NUM Use NUM as initial output number (default: 0). -o, --overwrite Overwrite existing output files. -q, --quiet Nothing is written to standard output.
Copyright (C) 2002 Marco Schmidt
This program is distributed under the GNU Lesser General Public License 2.1. See http://www.gnu.org/copyleft/lesser.html for more.
