Monthly Archive for November, 2006

Beagle and the Legend of the Archive Filter

It was many a long years ago when a brave man first suggested that beagle buckle down and offer comprehensive and complete indexing of archives. Not just plain text files, or certain subsets, beagle was to inspect every file, inside of an archive or out with the same degree of scrutiny.This was suggested over a year ago, and just this week, thanks to dBera’s hard work, it finally got into CVS. Its not done, its not polished, but its there, and its functional. The biggest accomplishment by far was the addition of graceful and complete child indexables support in the file system backend. While at first this seems like a very technical detail, and not of much interest, it actually enables beagle to do a lot of things that it couldn’t before. For example, if you happen to have some old mail files lying around with an embedded attachment, beagle will read and index those correctly, almost as if it were from a mail backend. (Theres still work to do if we want that as a use case, but its funny that it happened). This also means beagle can handle those pesky gzipped man/monodoc pages (not quite yet, but soon, I have to add single entry support still).

Despite the rampant awesomeness that is just radiating out of beagle at the moment, the Archive support still has a long way to go, I filed a bug about it, but wanted to go into more depth here.

1) We have an open connection to the archive for the entire time it takes us to
index it, this could be into hours for large archives, and if we were to crash,
we just killed someones backup....

2) GUI side of things, not a while lot to say here other than we should
probably try to do something intelligent with all the results as well (maybe
collapse them into one result that can be expanded, like we talked about doing
with the WebHistory), other than that, nothing to crucial, it would be nice if
we added a context menu option to extract the file in question, but thats
defiantly not on the short list.

3) File Locking, its rare and sporatic, as well as near impossible to
reproduce, but when you get a couple thousand temporary files going
asyncronously, it was bound to happen. Beagle recovers smoothly, but its
probably indicitive of a greater instability that goes hand in hand with that
many asyn actions.

4) Mid Index Shutdowns don't purge temporary files, it left me with a 300 mb
build up when I was testing.

Which leaves us with our work cut out for us, but I already started making the ArchiveFilter a little more sane. I have this patch, which takes care of most of the crazy temporary file madness and keeps the ChildIndexables cleaning up after themselves. While I was trying to figure out how to handle single entry Archives within our current setup, I realized that its probably just easier to create a FilterCompressed filter or the like which handles individually compressed files (like manpages and the like). I dunno, I’m looking at both options, but considering its pretty common to have beagle index someones /usr/share/doc folder, the manpage parser should probably air on the lighter and faster side, since the full ArchiveFilter is a bit of a memory hog (comes with the territory). Although another goal of mine is to prevent us from reading all the child indexables into one array, and just make it an enumerator style system.

In short, ther are a million things happening with beagle, I just chose to blog this one today, maybe I’ll mention some of the other cool stuff tomorrow.

The Brown Model of ADD Syndrome

I just recently stumbled across this article by a Dr. Thomas E. Brown on a new model for the definition of ADD, or Attention Deficit Disorder. Those of you that know me in person may be well aware that I have mild ADD, and many time have to fight with it to get even simple homework assignments done. Most of the literature that is mainstream on ADD seems to just write it off as an inability to focus, and unfortunately, most people accept that definition as more than enough. What struck me about Dr. Brown’s model was its breakdown of the different Executive Functions that ADD impairs. Below is a simple copy of that chart.

ADD

It was an incredible feeling to finally read something that seemed to describe what I actually felt, I have 2 choice quotes I wanted to share that basically embody what ADD is for me.

Patients with ADD describe chronic difficulty with excessive procrastination. Often they will put off getting started on a task, even a task they recognize as very important to them, until the very last minute. It is as though they cannot get themselves started until the point where they perceive the task as an acute emergency.

Which is an experience far too real for me, and :

Some describe their difficulty in sustaining focus as similar to trying to listen to the car radio when you drive too far away from the station and the signal begins fading in and out: you get some of it and lose some of it. They say they are distracted easily not only by things that are going on around them, but also by thoughts in their own minds.

Its the best description I have come across in a long time. I encourage anyone who wants to read more on Dr. Brown’s website.

Hello Beagle Planet!

Sidney The Happy BeagleAfter an itty-bitty little mix up over who actually ran the planet, I managed to get in touch with rlove, who quizzed me thoroughly on my morality, and then decided that it was indeed insubstantial enough to slip on by.

So in honor of be being bored and trying to put off a serious physics lab I present Sidney, a very cute beagle whom I know nothing about.

Official Member of the Gnome Foundation!

Yesterday evening I got my letter officially recognizing me as a member of the Gnome Foundation! I can’t begin to express my extreme gratitude and thanks to everyone at Gnome. But I would like to take a moment to thank Joe Shaw, Lukas Lipka, and Debajyoti Bera for basically putting up with me as I bumbled around the beagle code causing general havoc and mayhem.

So, go check out the official membership roll, and while your at it, note one of the newest additions (Thats me ;) )!

technorati tags:, ,

Case Wiki, Zimbra, and Bounties

Well, I’ve been cooped up sick all day today, and as I get more and more restless, I started to just wander about the abundant Case subdomains. It was on this little adventure that I discovered the awesome EECS site. Now, obviously I had to know that when you come to a school like Case, and you major in Computer Science, your going to have some of this awesomeness, but there were bounties! In short, all the dumb stuff that I do to avoid homework could get me paid! (assuming of course the bounty site isn’t horribly out of date, which I’m praying its not).

As my horribly boring and groggy day wore on, I continued to explore the vast reaches of the Internet until I stumbled across something nifty that I saw some cool POC code for about 6 months ago. Zimbra has definitely started to flesh out and hit its stride. So much so that I had to try and integrate it with the Case network, so that theres that slim chance that it will replace our current… errr…. *system*. (Its plenty functional I guess, but its just sooooo ugly and completely anti-user.) Anyways, I digress. What I did accomplish was a compiled copy of Zimbra, and one that works (as long as your ok bsing the MX recorded needed. ) You can play around with it here, but at the moment it locks most everyone out, since I don’t have the permissions needed to bind to the Case LDAP servers, I sent an e-mail as I was instructed to on the wiki, so hopefully we can get some real stress testing or something.

Speaking of the Wiki, I wrote my first real article in it. Granted, its about myself, but still I contributed, and have a bunch of other articles I wanna write. I’m just a little spoiled by MoinMoin, so the MediaWiki syntax was a bit of a shock to come back to.

Anyways, I’ll be boring myself even more, as the fever continues to rise.

Technorati Tags: , , , , , , ,

powered by performancing firefox