The Jones Collection

Recovering a 27-Year Wildlife Legacy Through AI-Assisted Research

The field reports, data, and personal writings of Robert D. Jones Jr. — "Sea Otter Jones"
Izembek & Aleutian Islands National Wildlife Refuges, 1948–1974
Kim Bridges & Claude (Anthropic)  |  May 2026
38Reports Digitized
~1,300Pages Processed
72Species Documented
210Places Mapped
27Years Covered
1Personal Narrative
I

The Person: Robert D. Jones Jr.

Robert D. Jones Jr. served as Refuge Manager at Izembek National Wildlife Refuge and the Aleutian Islands National Wildlife Refuge from 1948 to 1974. Colleagues called him "Sea Otter Jones" — a name earned through decades of pioneering fieldwork with Enhydra lutris during the critical years of the species' recovery from near-extinction.

Jones was not merely an administrator filing required paperwork. He dove with SCUBA gear to study otter feeding grounds, captured animals by hand in Aleutian surf, conducted aerial surveys from military B-17 aircraft, and argued passionately for the species' protection. Over 27 years, he produced 38 narrative reports totaling approximately 1,300 pages — one of the most detailed firsthand records of Alaskan wildlife and environmental change from the mid-twentieth century.

But the reports were typed on manual typewriters, often in difficult field conditions, and stored as physical documents. By the 21st century, they existed only as degraded photocopies and scans — rich in content but largely inaccessible. Recovering this material became the central challenge of this project.

Lazily watching a careening piece of driftwood slip soundlessly astern, my mind in fancy wanders back through the pages of recorded history into unrecorded time. Then a vast sea, the abode of strange and sometimes formidable creatures, flooded this very spot.
— Jones, A Canoe Trip Down the Missouri (opening lines)
The principal research problem that concerns this headquarters is the life history of the Sea Otter (Enhydra lutris). Studies were conducted during the period covered by this report and the information obtained follows.
— Jones, Refuge Narrative Report, 1949
II

Source Materials

The source materials present challenges typical of mid-century government field records. The 38 reports span 1948 to 1974 and were typed on manual typewriters, sometimes carbon-copied, and later photocopied and scanned at varying quality levels. OCR errors from conventional software are pervasive: "Izembek Bay" appears in 21 variant spellings across the corpus, from Iseabek to Issobok to Isenbok.

The reports combine narrative accounts with tabular data (species counts, weather observations, census figures), making them resistant to simple text extraction. Tables are formatted with typewriter spacing, columns don't always align, and page breaks interrupt data series. A standard OCR workflow recovers text but loses structure and meaning.

A separate personal narrative — Jones' account of a canoe trip down the Missouri River, typed on a similar machine — was recovered as a companion document. At 18 pages, it provides unique insight into Jones as a person: his literary sensibility, his naturalist's eye, and the character traits that would later sustain him through 27 years in the Aleutian wilderness.

III

Methodology: AI-Assisted Recovery

The recovery pipeline combines a vision-language OCR model (olmOCR-2-7B) with AI-assisted contextual interpretation. This two-stage approach is the key methodological contribution: raw OCR extracts text; AI understands it.

Scanned PDFs
38 reports
~1,300 pages
AI Vision OCR
olmOCR-2-7B
via DeepInfra
AI Context
Error correction
Structural parsing
Collection
Data, maps, narratives
Gazetteer, timeline

The contextual understanding stage is where AI transforms raw OCR into scholarship. Consider the gazetteer: 875 raw place-name extractions were deduplicated to 210 curated locations by recognizing that Isembek, Iseabek, Isombok, and 18 other variants all refer to Izembek Bay. This requires not just pattern matching but geographic knowledge — understanding that Andrew Lake, Andrews Lake, Andrew Lagoon, and Andrews Lagoon are all the same feature.

For species data, AI recognized table structures that conventional OCR flattened into unstructured text, reassembled broken data series across page boundaries, and distinguished between census counts, casual sightings, and historical references — distinctions that matter for scientific analysis.

The same contextual capability recovered Jones' voice in the narrative extractions. When assembling the Sea Otter thematic narrative, AI identified and preserved Jones' characteristic directness: his wry humor, his precise field observations, his willingness to report failures alongside successes.

Methodology Highlights

IV

The Collection: Components

The Jones Collection comprises the following components, each produced through the AI-assisted pipeline. Together they make Jones' 27-year body of work accessible for the first time in digital, searchable, analyzable form.

📄

Annotated Editions

All 38 refuge narrative reports, OCR-transcribed and formatted as clean, readable documents. Each preserves Jones' original text with annotations noting OCR-uncertain passages.

38 reports • DOCX + PDF • 1948–1974
🌎

Geographic Gazetteer

210 curated place names extracted from the reports, deduplicated from 875 raw mentions. Includes coordinates (30%), region assignments (98%), feature types, mention counts, year ranges, and OCR variant spellings.

210 locations • CSV + PDF • 10 columns
🐘

Sea Otter Data

Structured dataset of sea otter observations drawn from across the report series. Census counts, locations, behavioral observations, and population trends over 27 years.

52 curated records • CSV
🗺

Sea Otter Census Map

Interactive map plotting sea otter observation locations across the Aleutian chain, from Attu to the Alaska Peninsula, with population data overlays.

Interactive HTML • Leaflet
📚

Sea Otter Narrative

Thematic narrative assembled from Jones' own words across all 38 reports. Eight sections: the researcher, the animal observed, population recovery, human dimensions, nuclear testing, habitat, management philosophy, and legacy.

8 thematic sections • DOCX + PDF
📅

Interactive Timeline

Visual timeline of Jones' 27-year tenure with filterable categories: sea otters, wildlife, environmental events, management actions, and historical context. A table of contents to the entire report series.

Interactive HTML • 4 eras • 5 filters
📜

SEB Workshop Paper

Methodology paper prepared for the International Society for Ethnobotany AI Workshop, documenting the pipeline as a replicable approach for recovering historical field records in ethnobiology.

8-page PDF • Methods focus

The Canoe Story

Jones' personal narrative of an 8-day, 265-mile canoe trip down the Missouri River. Recovered from a separate typescript, it reveals the naturalist, writer, and adventurer behind the refuge reports.

18 pages • DOCX • Mobridge to Chamberlain
V

The Sea Otter: A Case Study in Recovery

The sea otter work serves as the proof of concept for the entire collection. Jones documented the species' recovery from near-extinction with a depth of observation that is, in retrospect, irreplaceable. His reports track the progression from the first confirmed sighting at Cold Bay in 1955 through the colonization of Izembek Bay in the 1960s to the complex management challenges of a recovering population sharing its habitat with nuclear testing at Amchitka.

From the data table, map, and thematic narrative, a researcher today can reconstruct the sea otter's return to the eastern Aleutians in a way that was impossible from the original undigitized reports. The 52 curated data records, plotted on the interactive map, reveal spatial and temporal patterns in the recovery. Jones' own words provide the ecological context that raw numbers cannot.

The first Sea Otter recorded at Cold Bay was observed by myself … This animal was observed at very close range and there was absolutely no question of its identity.
— Jones, Refuge Narrative Report, 1955
My experience trying to inventory Sea Otter from the nose of a B17 at 140 to 150 KMH has not been very successful principally because of the speed.
— Jones, Refuge Narrative Report, 1949

The sea otter work demonstrates the methodology's capacity to extract structured data, geographic information, and narrative voice from the same source material. The approach is directly transferable to the other 71 species documented in Jones' reports, and to comparable historical document collections in other research domains.

VI

The Canoe Story: The Person Behind the Science

Separate from the refuge reports, the collection includes a remarkable personal narrative: Jones' account of an 8-day canoe trip down the Missouri River from Mobridge to Chamberlain, South Dakota — a distance of 265 miles. Written while Jones was employed as a District Supervisor for Grasshopper and Mormon Cricket Control with the Bureau of Entomology and Plant Quarantine, the story predates his Alaska years and reveals the person who would become "Sea Otter Jones."

The narrative is vivid and literary. Jones describes the Missouri's sand boils with a scientist's precision, details his provisions and preparation with a Scout's thoroughness, and observes wildlife — a prairie falcon stooping on a common tern, fawns playing on an island — with the same keen eye that would later track sea otters across the Aleutians. He and his companion "Bob" Sabs adopted a stray kitten named "Cheyenne" at the Cheyenne River, executed a 500-foot portage with help from Sioux Indians at Lower Brule, and paddled through glorious sunsets and mosquito-plagued nights.

The canoe story was recovered from an 18-page scanned typescript using the same AI-assisted OCR pipeline developed for the refuge reports. Its successful transcription — from a different document type, without tables or data, in a purely narrative voice — validates the pipeline's generalizability beyond government wildlife records.

Here, drifting down the "Big Muddy," one comes face to face with the signs of those wild and primitive forces which prepared the world for a habitation of man.
— Jones, A Canoe Trip Down the Missouri
The only thing I might have requested to complete the feeling of transcendency would have been a full symphony orchestra rendering a masterwork.
— Jones, on the Missouri River scenery
Real friendship does not, however, take time nor distance into account, and as we separately turned away from the river I knew we would forever meet there on its inexorable course to the sea.
— Jones, closing lines
VII

For Future Researchers

This collection is deliberately unfinished. It demonstrates what AI-assisted recovery can accomplish; it does not exhaust what Jones' reports contain. The sea otter case study is a proof of concept. Seventy-one other species await similar treatment. The gazetteer provides a geographic framework, but the reports contain environmental observations, weather records, and management decisions that could yield structured datasets of considerable value for contemporary research.

The canoe story arrived with some lines clipped during scanning. A researcher with access to the original typescript can restore them. The annotated editions flag OCR-uncertain passages that could be verified against source documents. The methodology paper documents the pipeline in sufficient detail for replication.

Opportunities for Further Research

Jones spent 27 years in the field, observing and recording with a dedication that was extraordinary even by the standards of his era. This collection is an attempt to honor that work by making it accessible again — not as an archive to be preserved, but as a foundation to be built upon.

Before our figures can become a sound basis for management the method of census taking must be perfected.
— Jones, 1949 — a standard he held himself to for the next 25 years