SitePoint: Run Your Own Spider | Linux Today

SitePoint: Run Your Own Spider

Written By
Web Webster
Web Webster
Sep 30, 2004

“I came across Carlos Perez’s blog, manageability.org, while
Googling for some research today. Carlos had a great list of open
source web crawlers that included JSpider, a tool I have used for
error checking on web sites.

“JSpider is written entirely in Java and can be configured
extensively for spidering, error checking and downloading. It of
course obeys robots.txt files and additional options included in
configuration…”

Complete
Story

Web Webster

Web Webster

Web Webster has more than 20 years of writing and editorial experience in the tech sector. He’s written and edited news, demand generation, user-focused, and thought leadership content for business software solutions, consumer tech, and Linux Today, he edits and writes for a portfolio of tech industry news and analysis websites including webopedia.com, and DatabaseJournal.com.

Linux Today Logo

LinuxToday is a trusted, contributor-driven news resource supporting all types of Linux users. Our thriving international community engages with us through social media and frequent content contributions aimed at solving problems ranging from personal computing to enterprise-level IT operations. LinuxToday serves as a home for a community that struggles to find comparable information elsewhere on the web.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.