The Danger of Fake Reviews

The Danger of Fake Reviews

How to Spot Fake Reviews in Time for the Holidays Black Friday is here, and as the biggest shopping day of the year, it means a lot of people will be on […]

3 Predictions for Web Data in 2018

3 Predictions for Web Data in 2018

2017 was a turbulent year: With Donald Trump shaking up the American political system, cryptocurrencies causing riptides throughout financial markets, and advancements in artificial intelligence sparking both anticipation and anxiety in the […]

Was Black Friday Worth It? The Crawled eCommerce Data Answer

Was Black Friday Worth It? The Crawled eCommerce Data Answer

Now that we’ve had a chance to recover from the Black Friday and Cyber Monday crazes, which chalked up record sales this year, it’s time to ask the inevitable question: was Black […]

5 Great Reasons to Meet Us at Strata

5 Great Reasons to Meet Us at Strata

If you’re visiting this year’s Strata Data Conference in New York, you can find us at Booth #P17, and absolutely should. Here are 5 reasons why our (modest) booth is probably going […]

Machine Learning Showdown: Python vs R

Machine Learning Showdown: Python vs R

Let’s say you have an amazing idea for a machine learning app. It’s going to be brilliant. It’s going to revolutionize the world of finance, mobile advertising, or… some other world, but […]

How to Use Online Review Ratings to Crush the Market

How to Use Online Review Ratings to Crush the Market

Sifting through millions of posts on review sites presents both a massive undertaking and an incredible opportunity for influencer marketing. Some of the most successful app makers are capitalizing on that oppotunity. Use […]

The Race to Achieve 100% Coverage of the Web

The Race to Achieve 100% Coverage of the Web

In our new report, we deconstruct the all-too-familiar race to achieve 100% coverage of the web. Data acquisition efforts usually rely on one of three approaches – build an internal web crawling […]

Guide to Structured Web Data Consumption: How to get instant access to news, blogs, and online discussions

Guide to Structured Web Data Consumption: How to get instant access to news, blogs, and online discussions

The full guide to structured web data consumption

5 Ways to Measure the Impact of Crawled Web Data on Your Business

5 Ways to Measure the Impact of Crawled Web Data on Your Business

The analysis you provide is only as good as the raw data you start with. Although data from the open web is often perceived as a commodity, not all crawled data is […]

Webz.io Archive Access is now LIVE

Webz.io Archive Access is now LIVE

Following popular demand, we are really happy and excited to grant access to Webz.io’s historical data archive. This is the first time that anyone can programmatically access a huge index of the internet […]

The Top 10 Data & Analytics Articles of 2015

The Top 10 Data & Analytics Articles of 2015

The online world of data and analytics is fast approaching epic portions. It’s easy to get overwhelmed. Why? Because, not only has big data been big business in 2015 … but posts, […]

Tiny basic multi-threaded web crawler in Python

Tiny basic multi-threaded web crawler in Python

If you need a simple web crawler that will scour the web for a while to download random site’s content – this code is for you. Usage:

Where https://cnn.com is your seed site. It could […]