Arch Search Engine v.1.9.2

Advertisement
Advertisement

Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google's global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great. Don't believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and/or Google (on the public part of your site) using a blind test methodology.

In addition to the excellent search quality, Arch has many features critical for corporate environments:

- Document level security. Users can find only documents that they are authorized to see.
- Inexpensive index updates. Arch is able to keep indexes up to date and avoid regular complete site recrawling.
- 24/7 availabilty. There is always a working index available, even if a crawl fails.
- Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy.
- An automatically generated site directory.
- Low cost support once deployed.
- Dual interface (PHP and Java) for easy deployment and customization.
- Faceted search "out of the box".
- An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc.
- A modular, plugin-based architecture that can be easily customized and extended.
- The source code is included.
- High performance and scalability. Arch can run on computer clusters to index very large data sets.

An open source corporate search engine. Arch is an extension of Apache Nutch (a popular, extensible and highly scalable general purpose search engine) for intranet search. Unhappy with your corporate search engine? Try Arch. It can help. Don't believe? Blind test evaluation tools included.

intranet search, corporate search, search engine, intranet search engine, corporate search engine, nutch, arch, java search engine, full text search, web site search, company search engine

 
  • Arch Search Engine
  • 1.9.2
  • 18 Aug 16
  • CSIRO Astronomy and Space Science
  • Win2000, WinXP, Win7 x32, Win7 x64, Windows 8, Windows 10, WinServer, WinOther, WinVista, WinVista x64
  •  
  • Freeware
  • 22.79 Mb
  • 769
  • Free
 
 
Latest Versions History
Version Date Released Release Notes
1.9.2 18.08.2016 Improved document parsing, ported on Nutch 1.9.
1.7 17.06.2014 Added security scanning, ported on Nutch 1.7.
 
 

Review Arch Search Engine

  • captcha
 
 
New Search Tools software
  • Cute Web Phone Number Extractor  v.2.8.4Cute Phone Number Extractor is an easy-to-use, fast and result-oriented telephone/mobile/fax number extractor software. It extracts numbers from search engines, websites and local files on computer.
  • LinkedIn Recruiter Extractor  v.4.0.14LinkedIn Recruiter Extractor extracts Leads from LinkedIn and LinkedIn Recruiter. It is the best tool to captures name, email, business name, address, phone number, websites, country, profile link and much more from LinkedIn and LinkedIn Recruiter.
  • FreePortScanner  v.3.4.9Free Port Scanner is a small and fast port scanner for the Win32 platform. You can scan ports on fast machines in a few seconds and can perform scan on predefined port ranges.The tool is designed with a user-friendly interface and is easy to use.