About Consolidated Logbook Data
Database description
American whalemen sailed out of ports on the east coast of the United States and in California from the 18th to early 20th centuries, searching for whales throughout the world’s oceans. From an initial focus on sperm whales (Physeter macrocephalus) and right whales (Eubalaena spp.), the array of targeted whales expanded to include bowhead whales (Balaena mysticetus), humpback whales (Megaptera novaeangliae), and gray whales (Eschrichtius robustus). Extensive records of American whaling in the form of daily entries in whaling voyage logbooks contain a great deal of information about where and when the whalemen found whales.
The American Offshore Whaling Log database includes information from about 1,500 logbooks from American, and a few British, offshore whaling voyages between 1784 and 1920. These data were extracted from the original whaling logbooks during three separate scientific research projects, one conducted by Lt. Cmdr Matthew Fontaine Maury in the 1850s, the second conducted by Charles Haskins Townsend in the 1930s, and the third conducted by a team from the Census of Marine Life project (CoML, www.coml.org) lead by Tim Denis Smith between 2000 and 2010. The Maury and Townsend data were assembled by the CoML team from archival sources, and there were limitations on the quality and completeness of the data available. In addition, entries were added for strikes on right whales by Dale Chatwin for a study of Indian Ocean Southern Right Whales (IOSRW), and revised entries for some Maury entries were made directly from the relevant voyage logbooks.
The data file includes 468,249 data records assembled in a common format suitable for spatial and temporal analysis of American whaling throughout the 19th century. The data fields are described in the Column definitions.
The projects selected whaling logbooks in different manners and for each logbook the details of the data extracted varies. Maury extracted data from most days the vessel was a sea, but often omitted the initial portions of many voyages in the North Atlantic and many days when the whalers were whaling within enclosed bodies of water (e.g. the Okhotsk Sea). Townsend extracted data only from the days when whales were struck or killed, leaving potentially large gaps in the sequence of dates and positions for any single voyage. CoML data were collected from all days recorded in the logbooks.
Spatial and temporal gaps occurred in all the data by design and because some logbooks did not include entries for some days. In some cases the gappy data were interpolated, especially where the gaps were small but also at times for larger gaps. Unfortunately, the latter at times created interpolated data points on land, especially when vessels crossed the date line or when logbooks ended well before the vessel returned to its home port. Not all of these problems have been addressed in these data.
In the age of traditional whaling the process of determining a vessel’s location was an imprecise affair, and on whaling vessels only one location reading was usually taken each day–at noon. Latitude and longitude readings can be subject to error in the region of 2 to 3 degrees in any direction for both latitude and longitude (i.e., out by hundreds of miles) and even when multiple records of a voyage exist (as they often do) conflicts arise due to the skill (or not) of the officer taking the reading. Location readings are therefore, at best, an estimate of where a vessel was, where whales were sighted and struck, and where contemporary and modern indexers believe the vessel to have been located. Only when sightings are made of known places is it possible to be certain of a location.
One must also be aware that at sea whaling vessels used Ship Time. The Day on a whaling vessel ran from noon on the previous day until noon on the day actually recorded in the log. This aided the vessel in recording events of the voyage and helped manage the long period of the night as one event, when the vessel usually ‘hove to’.
Also, both the logbook keepers and the data extractors sometimes made mistakes, for example transposition of digits and, rarely, miscoding of hemispheres. These have been corrected when plotting of individual voyage tracklines have revealed such problems during some data analyses. However, not all of these have been corrected.
The data extractors attempted to identify the common name of cetaceans recorded in the logbooks, but this was not always possible because encounters were sometimes described only as unknown or as “whale.” Thus some of the cetaceans encountered are listed here only as “Whale.” (See also the caution on About Maury’s Data.) The CoML data included encounters with a wider range of cetaceans than the other two sources did. Maury’s data were only from logbooks completed before the early 1850s so do not include some species that had not then been targeted, for example bowhead whales. A more complete description of these data is given in the Materials and Methods section (see page 11) of a research paper authored by some of the CoML team; the paper itself is freely available online as Smith TD, Reeves RR, Josephson EA, Lund JN (2012) Spatial and Seasonal Distribution of American Whaling and Whales in the Age of Sail. PLoS ONE 7(4): e34905. doi:10.1371/journal.pone.0034905.