Saturday February 11, 2012 @ 11:32:59 GMT+10    ( Weather:  n/a )
Home » Weblog Archives

Powered byD's Bloggie
Weblog » Archive » 2006 » September » Day 08 » Stucked with "homemade" search engine for own weblog
Browse by specific entry ...
 1920 x 1200 screen shot of my websiteWhere is the cute donkey ?? 

Stucked with "homemade" search engine for own weblog - 11:40 pm
help me ! help me !
Site Issue , SQL  -  poster 

It's weekend again, it's also time for me to make progress on updating my site. This time I've completed alot of things that's on my todo list, I would say this will be another major site update. However, it will be backend-wise, nothing much unless I do some cosmectic updates (which is also on my todo list @@)

Anyway, the biggest problem I encountered right now is building my own search engine that search through my weblog entries with ability to sort results by relevancy/date/topic/category. There are few easy ways to do this, one is using MySQL built-in "Full-Text Search" function, which also includes boolean full-text searches using IN BOOLEAN MODE modifier (eg using + - etc signs). Another way to do this is using a simple sql query with LIKE %string% clause.

Everything sounds so sweet and nice before I realize that my blog entries contains bbcode tags. The content of the weblog entries are rendered on-the-fly. So if you searched a keyword that's used for a bbcode tag, then the return result will not be accurate. MySQL also requires pre-indexing the data before you can use the full-text search, that means you'll need extra database storage for indexes and slower performance too.

So, I want to do a search on my rendered weblog entries instead of entries in raw format. That's possible, but that will probably be done with regex in gayness loops in PHP rather than MySQL (well, at least that's the way I can think of). If not then I'll have to stripped the bbcode tags and build my own indexing system. That's kinda redundant and I won't implement it with a complicated relevancy matching algo, so what's the point eh ?

I read some articles/guide/how-to but neither of them mentioned or have solution to the problem I mentioned. So I went to a few other people's weblog/forums to test this. The test results surprised me. Just try yourself in those "homemade" personal website/blogs and compare the ones that uses systems like phpbb/blogger.com. Use search query like "[b ]" or "<b>" and you'll know what I mean.

For now, I'll just see what I can come out with. If anybody has experience in this or knows great resources please let me know *scratching head... *

Some readings:
MySQL Full-Text Search Functions
Solano Systems - MySQL Relevancy Search
 1920 x 1200 screen shot of my websiteWhere is the cute donkey ?? 

Reader comments ( Add your comments )

- No comments (yet) -

Leave a public comment

Info Note: Neither BBCode nor HTML tags are supported currently.
Why ? Just to make my life easier and keep things simple.
Others: Comments that are longer than 1000 characters will be truncated.

    ) Info

Info

Info


Before you submit... verification

Please enter verification codes as seen in the image above to post your comment.
If you can't read the word, click here
(Your comment will be manually approved by admin before it's displayed to public)

$ view_blog.php 2009.09.17 18:16:41 $
Lost? | XML/HTML sitemap | Contact
38.107.179.241 , 22 queries , 0.7204s
Gzip enabled , CSS compressed , JS compressed
Copyright © 2005-2011 Darren's Outpost