Web Data

Crawling the TOR network – Challenge Accepted!

The following short story portrays the surprising technological and logical challenges we faced while developing our dark web monitoring technology....

Webz.io Image Recognition Helps Identify Illicit Content

How Webz.io Uses Image Analysis and Recognition to Identify Illicit Content on the Dark Web Collecting data from the Dark...

How Does a Web Crawler Work?

  With the advent of the digital age and its unprecedented source of data, both individuals and organizations alike wanted...
main-image-1

The Danger of Fake Reviews

How to Spot Fake Reviews in Time for the Holidays Black Friday is here, and as the biggest shopping day...

What is the Omgili Bot, and why is it Crawling Your Website?

Hi there. If you’re reading this, it’s probably because you’ve run into Omgilibot – perhaps in your web analytics or...

How to Extract Data from Websites: Scraping Tools, DIY or DaaS

This is part 2 of our guide to web data extraction. Read part 1 to learn about the questions to...

Web Data Extraction Guide: 11 Questions to Ask

The following is an excerpt from our new Web Data Extraction Playbook. We’ll be publishing the second part next week,...

Article’s publication date extractor – an overview

A few days ago I’ve released an open source Python module that provides you with a simple way to extract...

Dead simple {for devs} python crawler (script) for extracting structured data from any website into CSV

On my previous post I wrote about a very basic web crawler I wrote, that can randomly scour the web and...
Footer Background Large
Footer Background Small

Power Your Insights with Data You Can Trust

icon

Ready to Explore Web Data at Scale?

Speak with a data expert to learn more about Webz.io’s solutions
Speak with a data expert to learn more about Webz.io’s solutions
Create your API account and get instant access to millions of web sources
Create your API account and get instant access to millions of web sources