Open Source Search Engine Software!
SeekQuarry is the parent site for Yioop
. Yioop is GPLv3
, open source, PHP search engine portal software.
What can Yioop do?
Yioop software provides many of the same features of larger search portals:
- Search Results. Yioop comes with a crawler that can be used to crawl the open web or a selection of URLs of your choice. It can index popular archive formats like Wikipedia XML-dumps, arc, warc, Open Directory Project-RDF (now Curlie.org), as well as dumps of emails or databases. Once you have created Yioop indexes of your desired data sources, Yioop can serve as a search engine for your data. It supports "crawl mixes" of different data sources. Yioop also provides tools to classify and sculpt your data before being used in search results.
- Media Services. News is best when it is still fresh. Yioop has a media updater process that can be used to re-index RSS and Atom feeds on an hourly basis. This more timely information can then be incorporated into Yioop search results. Yioop's media updater also can be used to recode to mp4 uploaded videos, handle bulk email, and calculate view statistics.
- Social Groups, Blogs, and Wikis. Yioop can be configured to allow users to create discussion groups, blogs, and wikis. If Yioop is configured to allow multiple users, then users can share mixes of crawls they create. Blogs and discussion group can be made public or private and posts can be made to expire if desired. Yioop supports an easy to learn ChatBot api to make group feed chat bots. Public groups have public RSS feeds and the better amongst these can be chosen for incorporation in what Yioop's news service indexes. Each group also comes with its own wiki that can be used to make standard wiki pages as well as presentations and spreadsheet documents. Images and video can be uploaded to both feeds and wiki pages and Yioop can be configured to automatically convert video to web viewable formats.
- Web Sites. Yioop's wiki mechanism can be used to build websites. Besides generic, localizable wiki pages, it supports slide presentation pages, media list pages, and mechanisms to compose pages from subpages. It also has a Model View Adapter framework which can be easily extended to build customized search portal websites. Yioop can also be integrated into existing sites to provide search functionality either through an API, Open Search RSS, or JSON services. Yioop comes with stemmers, summarizers and other natural language processing tools that you can use as a package in your project via Composer.
- Monetization . Yioop sites can make use of external ad servers. Seekquarry also sells a payment processing script to support charging for internal keyword advertising and the ability to set a fee to join particular groups.
The software and hardware requirements for Yioop are relatively low. As Yioop implements its own web server, at a minimum, you only need a version of PHP 5.4 or later to run Yioop. A test set-up consisting of three 2011 Mac Mini's each with 8GB RAM, a single name server, and five fetchers can crawl about 100 million pages in a month.