Ran Geva

News API

Auritus: Open-Source, Public Relations Monitoring Platform

The reason I started Webz.io is because I experienced the difficulties in collecting web data at scale when I worked on a previous project named PRTrack.it. At PRTrack.it we wanted to create...

Web Data

The Danger of Fake Reviews

How to Spot Fake Reviews in Time for the Holidays Black Friday is here, and as the biggest shopping day of the year, it means a lot of people will be on...

Machine Learning

Financial success using AI and Time Travel

Wait let me explain. I can explain every part of this click-bait title, it will make sense I promise. So, A great philosopher named Homer Simpsons once said: “Trying is the first...

Web Data

Calling all (almost) Kimono Labs Developers to Migrate to Webz.io

Kimono Labs made an announcement today that it has been acquired by Palantir. Unfortunately Kimono Labs users will only have two weeks to migrate to a different service because the team will...

Web Data

Article’s publication date extractor – an overview

A few days ago I’ve released an open source Python module that provides you with a simple way to extract and normalize the publication date of any online blog or news post....

Machine Learning

Ever imagined how "Big Data" looks like?

We have created a fun little experiment, letting you navigate in a 3D universe of real data from the open web. The data is made out of important news and blog titles, their meta-data...

Company

30-Days of Historical Data Access for Webz.io Now Available

I’m very happy to let you know about the launch of our extended access to 30-days of historical data from Webz.io, which is available to our paying customers immediately. No waiting list....

Web Data

To crawl or not to crawl, that is the question

In order to write an efficient crawler, you must be smart about the content you download. When your crawler downloads an HTML page it uses bandwidth, memory and CPU, not only its...

blog dead simple for devs python crawler script for extracting structured data from any almost website into csv

Web Data

Dead simple {for devs} python crawler (script) for extracting structured data from any website into CSV

On my previous post I wrote about a very basic web crawler I wrote, that can randomly scour the web and mirror/download websites. Today I want to share with you a very simple...

Ran Geva

Sort by

Ready to Explore Web Data at Scale?