Crawling the Dark Web to Detect the Next Market

Crawling the Dark Web to Detect the Next Market

Over the past few days, the internet has been abuzz with talk of the recent blows dealt by law enforcement to two major dark web “marketplaces”, AlphaBay and Hansa market; and the […]

Can Data Science Deliver a Fake News Detector?

Can Data Science Deliver a Fake News Detector?

Regardless of your political opinion, fake news has dominated the conversation since the 2016 US presidential election. The crux of the problem is that the very definition of what qualifies as fake […]

The Hackathon Award for Best API Mashup Goes to…

The Hackathon Award for Best API Mashup Goes to…

Competitive programming competitions, commonly referred to as Hackathons, offer a great opportunity for new talent to show what they can do. Much like professional sports, industry leaders send recruiters to scout out […]

Webz.io API Featured in New Guide to Web Development with Django

Webz.io API Featured in New Guide to Web Development with Django

Last February, co-authors Leiff Azopardi and James Maxwell completed the latest edition of their book Tango with Django. It presents an excellent step-by-step approach to learning Python on the popular Django framework […]

How to use rated reviews for sentiment classification

How to use rated reviews for sentiment classification

Sentiment classification is a fascinating use case for machine learning. Regardless of complexity – you need two core components to deliver meaningful results; a machine learning engine and a significant volume of […]

Can Crawled Web Data Tell the Future?

Can Crawled Web Data Tell the Future?

Robert Tercek’s book Vaporized: Solid Strategies for Success in a Dematerialized World recently recently won GetAbastract’s 2016 International Book of the Year award at the Frankfurt Book Fair. Based in Hollywood, Robert has […]

Should you buy crawled web data or build your own solution?

Should you buy crawled web data or build your own solution?

In a technologically driven environment, the temptation to develop a proprietary web crawling solution is virtually irresistible. Our latest report examines the true cost of computing and software development resources required to deliver a data […]

Guide to Structured Web Data Consumption: How to get instant access to news, blogs, and online discussions

Guide to Structured Web Data Consumption: How to get instant access to news, blogs, and online discussions

The full guide to structured web data consumption

5 Ways to Measure the Impact of Crawled Web Data on Your Business

5 Ways to Measure the Impact of Crawled Web Data on Your Business

The analysis you provide is only as good as the raw data you start with. Although data from the open web is often perceived as a commodity, not all crawled data is […]

Why Extracting Content From The Open Web Is Better than Surveys for Research

Why Extracting Content From The Open Web Is Better than Surveys for Research

What’s the best way to find out how people feel about a given topic? Simply ask them, right? Well, at least that’s what we’ve been led to believe. Standard polling practice tells […]

100% Coverage of the Web

100% Coverage of the Web

Achieving 100% coverage is the Holy Grail. To be able to tap into the World Wide Web is something that anyone dealing with data would like to have, but is far FAR […]

How Crawled Data Gave One News Outlet the Edge in the Israeli Election

How Crawled Data Gave One News Outlet the Edge in the Israeli Election

In the spring of 2015, as Israel prepared for general elections, virtually all of the mainstream media analysts believed that change was in the air. Conventional wisdom at that time had it […]