There is no need for any installation to use the Koovan application. Koovan is a fully web-based cloud platform. The data you have obtained is stored in your own cloud account. You can access your data whenever you want and download a copy to your computer.
This document describes the use in Koovan. The steps in the documentation are sorted by scraping processes that need to be followed. By following these steps, you can start using the system very easily.
Koovan is a web-based web scraping application. It can collect data from public web pages, as well as facilitates data collection from web pages that have access restrictions. You can start using immediately by following the steps below.
The purpose of the document is to explain how you can collect data from a web page, how you can process and download the data you collect. If you cannot find what you are looking for in this document, please contact the support mail address immediately.
To crawl a website on Koovan, you must follow these steps.
You will find the details of these steps later in the document.
In this section, the descriptions of the screens in the application are written. Detailed explanation of each screen is illustrated below.
There are two sections on this screen. One of them is the section with the statistical data of the last ten days and the other is the graph showing the last ten days of data lines of the last one month websites.
The number of domain names, the number of binary files downloaded, the number of rows downloaded daily and monthly, the cloud account occupancy rate and the crawling data count graph of the last ten days.
It is the screen where detailed information of the registered account is displayed.
It is the screen where the website domains are located. In addition to the total number of domains registered, a list of the last ten domains added is on this screen.
It is the screen where logs formed as a result of the tasks and system processes initiated by the user.
To add a new website, click on the Add New Domain button from the Websites link and enter the name of the domain name you want to add. If the entered domain name is in accordance with the rules, the registration process is completed successfully.
It is possible to collect the data on public web pages with templates you can determine. You can create as many templates as you want for a website. This section explains how to create a template for a website.
A crawling task can be created for one or more web pages. There are three methods for this. Multiple links can be entered via a single link, or a text file can be uploaded to link. Creating crawling through a single link can be defined as follows.