Extracting Data from Websites

Extracting data from web pages can take one of four forms, depending on the type of data that requires extracting:

  • Regular data: Unstructured data that has no repetitive patterns. An example is a paragraph of text.
  • Pattern-based data: Structured data that repeats a pattern, such as a row in a table or a list.
  • Table data: Data that resides in a table.