Skip to content

future: improve storm crawler output #30

@wumpus

Description

@wumpus

In order to be able to use Storm Crawler when it has features we want, we need to bring its output up to parity with nutch

  • request records
  • write robotstxt and crawldiagnostics warcs
  • metadata records?
  • other?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions