Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Trace Crawler SOFTWARE

Software ·
DOI:https://doi.org/10.11578/dc.20220615.2· OSTI ID:code-74911 · Code ID:74911

The trace crawler is a tool for selective web crawling to archive web resources with well-defined boundaries. The specific web navigation steps (or trace) are formulated for the families of webpages, where layout or HTML structure can be similar but the content is different, for example, GitHub, Slideshare, blogs, etc. The trace is recorded in a json file format.

Site Accession Number:
C22054
Software Type:
Scientific
License(s):
BSD 3-clause "New" or "Revised" License
Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE

Primary Award/Contract Number:
AC52-06NA25396
DOE Contract Number:
AC52-06NA25396
Code ID:
74911
OSTI ID:
code-74911
Country of Origin:
United States

Similar Records

Memento Tracer Extension
Software · Tue Jun 14 20:00:00 EDT 2022 · OSTI ID:code-74900

ScholarGuard
Software · Tue Oct 08 20:00:00 EDT 2024 · OSTI ID:code-148789

Nux, V.1.0
Software · Sat May 28 20:00:00 EDT 2005 · OSTI ID:code-314

Related Subjects