Table of Contents

Benchmarking

Files

All statistics for the Du Bois Digitization Project are kept in a spreadsheet called DuBoisProgress.xls. It can be found here:

r:/dubois_projects/dubois_digitization/project_management.

Utilities

countfiles.py

Usage:

To view options: countfiles -h

Standard usage:

countfiles -n -p tif -r 20110206-20110212 staging

This command counts all the files with the pattern (-p) “tif” that were created between the given date range (-r). The -n option will print out counts for each user name. The final argument should be the starting directory, typically the staging directory on the y: drive. I usually run this script from the staging directory and use a '.' for the final argument. To count xml files, substitute xml for tif.