How web data saves lives, identifies threats, and brings criminals to justice

How web data saves lives, identifies threats, and brings criminals to justice

The Webz.io team will remember February 13th 2017 primarily because of the opportunity to present at i-HLS Big Data alongside Fortune 500 leaders including IBM, SAP, and Dell. Over 500 participants convened…

How to use rated reviews for sentiment classification

How to use rated reviews for sentiment classification

Sentiment classification is a fascinating use case for machine learning. Regardless of complexity – you need two core components to deliver meaningful results; a machine learning engine and a significant volume of…

How to access, cite, and defend web datasets in academic research

How to access, cite, and defend web datasets in academic research

We’re used to getting questions about accessing structured web data. But recently, we’ve been fielding a different kind of use case.  Researchers and scientists have been asking about data citation conventions and how…

Can Crawled Web Data Tell the Future?

Can Crawled Web Data Tell the Future?

Robert Tercek’s book Vaporized: Solid Strategies for Success in a Dematerialized World recently recently won GetAbastract’s 2016 International Book of the Year award at the Frankfurt Book Fair. Based in Hollywood, Robert has…

Web Data Visualization of The Hillary Clinton Top 100 Network Graph

Web Data Visualization of The Hillary Clinton Top 100 Network Graph

The web data business can get pretty tricky, especially when your job is to extract the broadest possible dataset from the planet’s biggest database. Last week, Webz.io CEO Ran Geva ran a…

Should you buy crawled web data or build your own solution?

Should you buy crawled web data or build your own solution?

In a technologically driven environment, the temptation to develop a proprietary web crawling solution is virtually irresistible. Our latest report examines the true cost of computing and software development resources required to deliver a data…

Top 10 Big Data Stories Leading the Conversation

Top 10 Big Data Stories Leading the Conversation

In the right hands, crawled web data can tell an amazing story. We were interested in the top 10 news stories – sorted by social shares on Facebook and LinkedIn. So we set up a…

The Race to Achieve 100% Coverage of the Web

The Race to Achieve 100% Coverage of the Web

In our new report, we deconstruct the all-too-familiar race to achieve 100% coverage of the web. Data acquisition efforts usually rely on one of three approaches – build an internal web crawling…

Guide to Structured Web Data Consumption: How to get instant access to news, blogs, and online discussions

Guide to Structured Web Data Consumption: How to get instant access to news, blogs, and online discussions

The full guide to structured web data consumption

5 Ways to Measure the Impact of Crawled Web Data on Your Business

5 Ways to Measure the Impact of Crawled Web Data on Your Business

The analysis you provide is only as good as the raw data you start with. Although data from the open web is often perceived as a commodity, not all crawled data is…