Categories
Computing Java Web Development

Lucene and friends

As my current project draws to a close, I need to deliver document search capabilities.
SQL Server, despite how much I love it, requires you to have the documents reside *inside* the database for it to index them. That is ugly. So I ventured out and went with much-hyped and totally-cool Lucene from Apache. I am an Apache bitch, and why not, I love them and they love everyone.

So I ran some tests and everything works great and I can sleep at night.

Today, Saturday, after we release the first iteration of the live site and I am in post-stress bliss, I discover that Lucene does not index PDFs out of the box. Text is cool, PDF ain’t cool.

Rapid searches turn into slow searches, during which I find bizarro projects such as Docco or Multivalent – both of which are as hostile and not helpful as they get. Do you REALLY expect to documentation? Blech! RTF(non-existent)M!
I even stumble across over-the-top solutions that provide you a complete web application that will index the ass off your website – Zilverline, and that’s very cool but I need a component or library I can integrate into my code and Lucene was so neat and fit that bill.

Finally, I come across PDF Box – which pretty much was it. It reads PDFs, converts them to text, and Lucene can now play with PDFs!

Now on to Word where Apache’s Jakarta POI is supposed to provide the goods for Excel, Word and even PowerPoint… am I being too optimistic?

Share
Categories
Java Web Development

MyEclipse Headache – Removing WebProject Capabilities

After braving the upgrade to Eclipse 3.0 and installing MyEclipse we felt like we enjoyed using it and that it was benficial, especially its very very very cool ability to edit and text-complete JavaScript and HTML.
Still, we are unable to make it work properly with JSTL tags, which it could in previous releases.

So I went ahead and clicked on ‘Add WebProject Capabilities’ to the project from the MyEclipse context menu (right-clicking the project folder). This did not help. Worse, it forced me to have a WEB-INF folder no matter what. I would delete it and it would reappear. Grrrreat.

There is no way to remove WebProject Capabilities either. You gotta love one-way settings. After 20 aggravating minutes, the following surgery did the job:

1. Close Eclipse.
2. Edit the .project file and remove any elements that mention of Genuitec (the company builds MyEclipse).
3. Edit the .classpath file and remove WEB-INF from the classpath.
4. Restart Eclipse and right click the project folder
5. In the ‘Java Build Path’ screen, remove WEB-INF from the Output Folder settings and choose a different destination.
6. Delete the WEB-INF folder.

Phew!

Share
Categories
Web Development

Displaying DHTML over Flash

From my friend Rob Rolsma‘s findings:

To display a DHTML over a Flash object the Flash object must have its window mode set to transparent. This is achieved by setting the “wmode” parameter for the object tag AND the embed tag.

Within the object element for the Flash object include the following element:

<param name="wmode" value="transparent" />

Within the embed element for the Flash object, include the following attribute:

wmode="transparent"

Share
Share