Posted on February 7, 2010 - 05:51 by Robert Douglass.
Drupal's file handling capabilities keep getting better. Beyond the core upload module, the filefield module for CCK has enabled us to build sites with all sorts of files; documents, images, music, videos, and so forth. Searching within these docuements, however, has never been a common feature on Drupal sites. Some solutions have existed, particularly for extracting texts from PDFs and common wordprocessing documents. With Apache Solr, the attachments module, and an extension library called Tika, things can be much better. With Tika you can extract texts not only from Microsoft Office, Open Office, and PDF documents, you can also get text and metadata from images, songs, Flash movies and zipped archives. Searching for these texts is done as part of the normal Apache Solr driven site search.
Posted on December 7, 2009 - 13:15 by Kris Van Eyck.
We recently deployed a website to a production environment. Before that, the website lived on a staging server. On staging, the site got indexed perfectly by Acquia Search. When we made the move to production, we needed to clear out the index because of an issue we had (the index contained content from 2 drupal sites because they used the same acquia subscribtion key). On the production server, it now seems only new content gets indexed. The log is saying "Number of documents in index: 2".
Posted on September 30, 2009 - 12:31 by Robert Douglass.
For the last six months, Scott Reynolds has been keeping a big juicy secret. As the maintainer of the Apache Solr Views module, he knows just how cool the future of Drupal Search is going to be. His module, based on an idea and code from Thomas Seidl, lets you make custom searches against the Solr index the same way you currently make views against the MySQL database.
Posted on September 14, 2009 - 17:22 by Chris Charlton.
When utilizing the Acquia Search service, do we in any way need to alter the apachesolr/schema.xml file?
I've read posts where this file is altered for custom Solr setups and I'd like to know how Acquia subscribers should treat this file within their Drupal installations.
I'm trying to set up a WAMP sandbox to mimic my main Drupal install and I can't make the sandbox see my Solr installation. Solr works fine with Drupal on my main install.
Solr is nightly version from 4/15/2009
PHP 5.2x, Apache 2.0x, Drupal 6.11 are all the same as they are on my working box
Drupal module is 6.x-1.0-beta8
SolrPHPclient appears to date from 3/29/09
Tomcat is 5.5
I've copied the schema.xml and solrconfig.xml files from the Drupal moodule over to the Solr install
I can go to localhost/solr/admin/ and see that yes it's using these:
Solr Admin (drupal-0.9.2)
Posted on April 7, 2009 - 12:51 by Robert Douglass.
UPDATE: If you're working with the DRUPAL-6--2 branch please see the updated code example below.
In this article I will show you how you can write a tiny bit of code that will reveal new fields and facets for searching with the ApacheSolr module and Acquia Search. Using Acquia Drupal we'll write an example module that takes the file type from CCKfile and image fields and makes them into their own search fields. This results in us being able to filter our search results based on file type. This code fulfils the situation where you want, for example, to find a specific post that has a JPEG image, or all of the posts with PDFs that match a particular keyword.
Posted on December 19, 2008 - 12:47 by Robert Douglass.
Last week Acquia sent me to New Orleans to be a speaker at Lullabot's Do it With Drupal conference. The conference went very well and I gave a presentation about ApacheSolr, and how faceted search will change the way you think about finding things on your site. Since Acquia recently announced that we will be launching a hosted Solr search service even more people have shown interest in ApacheSolr.
Posted on November 18, 2008 - 04:32 by Jay Batson.
The search technology area is highly important to people with websites. As a result, I've spent serious time looking at it. Several things have come from this time spent:
The important thing: We'll soon be adding "hosted site search" capabilities to the Acquia Network for our subscribers. More about this below.