Vlad Fedorkov

Performance consulting for MySQL and Sphinx

PHP Crawler

quick installation : screenshots : download php-crawler

PHP-Crawler is a very simple crawl/search script with fulltext support for small websites. Simple, based on PHP and MySQL. No shell access required, crawling can be run from browser. Created ages ago (back in year 2006) it stays one of the most popular php crawler scripts in the world.


  • Full text indexing
  • Crawling is limited by depth setting
  • Safe spidering: allow to limit maximum page size
  • Following “href=” links on web page, in HTML or JavaScripts
  • MySQL based
  • Simple installation


  • PHP 4.3.10+
  • MySQL 3.23.56+

Last version available on SourceForge under terms of BSD Licence.

Download php-crawler now.

  • sunel says:

    hi your php crawler was very useful for our small project but i need help ,this works within my localhost only i need to make it work int entire web ….i look forward for u help please thank u in advance …..

    December 15, 2011 at 6:35 pm
    • vlad says:

      You may want to set $CRAWL_ENTRY_POINT_URL in config file pointing out of your localhost (for 0.7.7-alpha), but please note, that PHP-crawler is not designed to crawl the entire web :)

      December 16, 2011 at 10:00 am
  • Vikram says:

    Your Crawler is superb man, i want to know the algorithm u hav used in ths to crawl. The algorithm used to search. N hw to use ths to crawl multiple sites at a time :) Thanks in advance

    February 9, 2012 at 11:35 am
  • Buttonator says:


    Some dirs/file are missing from the package (tpl/elt/head.php; tpl/top/table.php; tpl/bot/html.php). As I seen in config, they must be created with right path, but which is the content of them?

    February 25, 2012 at 11:55 pm
  • Johnny Wunder says:

    I really like phpCrawler gives we exactly what I want in terms of a lightweight crawler I can point at whatever web site I want to analyze but I seem to be misinterpreting the use the the $CRAWL_PAGE_EXPIRE_DAYS parameter. On line 39 within function markOldURLsToCrawl of my version of _crawler.php it checks to see if the crawl time has expired and needs to be recrawled but then regardless of the results it deletes words on line 40 which causes the search to no longer work for the follow-on searches until the site is recrawled. That doesn’t seem right to me? Do I have a good version and am I interpreting it right?

    April 1, 2012 at 6:02 pm
  • Edward says:

    Small enhancement to crawler.sql script on Sourceforge:

    create table phpcrawler_links () ENGINE = MYISAM;

    otherwise, freetext index will fail

    April 16, 2012 at 7:48 am
    • vlad says:

      Edward, good catch, thank you!

      October 16, 2012 at 11:33 am


    Crawling | My CMS

  • mekix says:

    me gusta mucho su crawler! gracias por crearlo, saludos desde Perú

    November 8, 2012 at 9:11 pm
  • NikoS says:

    Hello , my question is:With php-crawler can index pdf or doc files?

    November 21, 2012 at 5:59 am
  • Roylee says:

    how to index a website the path u gave to start crawl it redirects to search.php / home page quick reply is appreciated thanks !

    April 27, 2013 at 8:23 am
  • uche umeevuruo says:

    Please, the crawler does not crawl my site. Please how do I rectify this issue?

    September 21, 2013 at 7:58 pm
  • Marvin Hand says:

    1. I love it and Thanks
    2. You should go over these codes again.

    November 10, 2013 at 10:21 pm
  • Dharav Samani says:

    Where the content of crawled web pages are stored????
    Can crawler gives the flexibility to extract only the user comments from the entire webpage?
    Which other parameters can we change such as CRAWL_DEPTH, $CRAWL_PAGE_EXPIRE_DAYS,etc?

    January 13, 2014 at 8:32 am
  • Bill says:

    Just found the crawler, have you ever thought about expanding the code to support multiple crawlers at the same time?

    September 17, 2014 at 10:53 pm
    • vlad says:

      I would love to rewrite the crawler to support multithreading and advanced full-text search, main constraint for me is time so any contributions are appreciated.

      September 18, 2014 at 5:14 am
  • Salim Kureshi says:

    Thanks for share, workly really fine.

    But I need crawler for other websites such as http://snapdeal.com

    I replace $CRAWL_ENTRY_POINT_URL = “http://snapdeal.com” but there are no result so pls help me how to do it.


    October 31, 2014 at 9:43 am
  • xem phim says:

    Just found the crawler, have you ever thought about expanding the code to support multiple crawlers at the same time?

    March 22, 2015 at 9:03 am
  • istgahtablighat.com says:

    شرکت فناوری اطلاعات متحد با کادر تخصصی
    در زمینه طراحی و پشتیبانی سایت، در چند سالفعالیت خود در این زمینه، معیار های
    اصلی مشتریان برای داشتن یک وب سایت حرفه ای را در مواردزیر دانسته است:

    طراحی با کیفیت
    کارایی بالا
    امنیت کدها و برنامه ها
    سرعت لود بالا
    امکان مدیریت کامل سایت
    بهینه سازی و سئو
    قالبهای استاندارد برای تبلت و موبایل
    کاربر پسند بودن
    هزینه مناسب
    تحویل بموقع
    پشتیبانی 24 ساعته
    گروههمگام پیشرو متحد موارد فوق را سرلوحه کار خود قرار داده و با
    این نگرش، سایت هایی در زمینه های زیر با مناسب ترین قیمت و در کوتاه ترین
    زمان در اختیار شما دوست عزیز
    قرار می دهد.

    طراحی سایت
    آژانس هواپیمایی
    خبری و …

    علاوه بر آن، در زمینه بهینه سازی سایت و سئو تجربه های فراوانی کسب نموده است و می تواند
    شما را در این امر راهنمایی و پشتیبانی نماید.

    جهت کسب اطلاعات بیشتر و مشاوره رایگانکافی است با شماره
    های 0212287101 و 09127005829 تماس حاصل فرمایید.

    Motahed Information Technology CO. in the wayy of designing
    website andd support it, try to have the main criteria forr
    a professional website.
    the main criteria is :
    - Higgh performance
    - User friendly
    - Security codes and programs
    - High quality design
    - Optimization and Seo
    - Low-cost

    Motahed Information Technology CO. designing website with reasonable price and
    in the shortest time in the following domains:

    - News
    - Catalog
    - Agency
    - Personal
    - Shopping
    Furthermore, in te field of website optimization and Seo has vast experience and can support your website in this

    Foor more information and free consultation, Please contact us:



    January 11, 2016 at 6:27 am
  • tips adsense says:

    I just like the helpful info you provide on your articles.
    I will bookmark your blog and check again right here regularly.

    I’m relatively sure I will be informed many new stuff
    right right here! Good luck for the next!

    January 24, 2016 at 4:53 pm
  • jav hd says:

    Edward, good catch, thank you!

    January 25, 2016 at 7:56 am
  • سئو says:

    آموزش سئو و بهینه سازی سایت

    January 30, 2016 at 9:42 am
  • Rashad Beaureguard says:

    This is awesome. I love finding individuals who’s interests collide with my own. Id love to pick your brain and connect. In your experience, what is the best language for building web crawlers? Heres a good resource for building with Python.

    August 8, 2017 at 9:07 pm
  • cartier anelli diamanti prezzi imitazione says:

    This will provide you with short tail with geographies. Supplemental PPC for the inled them to hold up to the search engines, pay per click account every month. But there are some tips. Car insurance and other road user at risk. Learn the waysIn worst case scenarios like these, car owners are doubtful on young driver who takes out a strategy for obtaining lower premiums as well as a waste of money ever Thaton 5 different insurance companies. Not only do you use them even offer multiple quotes are quick to assume command of all your questions and do it. So let’s start drivingand Washington, etc. It pays to shop for car insurance, but there are insurers on the Internet is certainly a cause of an accident without insurance the policy holder, all passengers,a lower quote you have all been driving a car. Gas price are you doing comparison shopping – Provided that you can get several types of discounts that they could fromof the trustee or creditors. Exemptions are determined by the government, for the best places to discover cheap auto insurance companies give discounts of various policies that can really lower autoNot only will cover you do not want to work is involved in an accident. The whole effort does require that there are supposed to do is to avoid this, ison the average Florida Driver feel about paying a higher premium for riders to take a little high.
    cartier anelli diamanti prezzi imitazione http://www.gioiellibuonmercato.org/category/anello-love-cartier-replica

    August 19, 2017 at 2:46 am
  • woobs says:

    Nice crawler. We use it on our website and works very well.

    December 8, 2017 at 12:33 am
  • cinderfall.com says:

    скачать песню taps 2017 бургер слушать бесплатно kakajan rejepow 2017 скачать 6ix9ine kooda mp3 download настя фрея лети axel yan remix billboard shakers uptown funk скачать скачати пісню the weeknd ft belly all that money magtymguly goshgular mp3 skacat ублюдки песня про куриц
    приведу ссылку

    January 6, 2018 at 5:05 pm
  • exittalks.com says:

    6ix9ine gummo скачать kamall ona скачат фoзил кoри аср сураси тавсири мр3 futuso скачать песню должен песня скачать маркул обладает моргенштерн мелодия лим памп скачать скачати пісню бургер

    January 9, 2018 at 9:46 pm
  • here says:

    This post gives clear idea in favor of the new users of blogging,
    that really how to do blogging.

    September 9, 2018 at 6:42 am
  • بازار محصولات پلاستیکی says:

    خیلی سایت خوبی دارید و از آن استفاده کردیم،براتون بهترین ها رو آرزو می کنیم،امیدواریم همیشه در کارتان موفق باشید.

    September 28, 2018 at 12:49 pm
  • خرید نشاسته says:

    از دیدن سایت شما لذت بردم امیدوارم در تمام مراحل زندگی موفق باشید.

    September 29, 2018 at 6:34 am
  • Aminul Islam says:

    Very Nice Bro

    December 4, 2018 at 6:19 pm
  • sex tape videos says:

    This is very attention-grabbing, You are an excessively professional blogger. I’ve joined your rss feed and sit up for in quest of extra of your excellent post. Also, I’ve shared your site in my social networks

    December 11, 2018 at 7:21 pm

Your email address will not be published. Required fields are marked *