Antville Project

Wednesday, 24. April 2002

Update on /referrers performance

I just played around with the antville database and noticed that it looks like there are no indices at all on the ACCESS table! (Except for the primary key, that is.) For example, simply querying for Access data for one weblog takes 10 seconds:

mysql> select count(*) from ACCESS where WEBLOG_ID=1;
+----------+
| count(*) |
+----------+
|     1614 |
+----------+
1 row in set (10.25 sec)

whereas a similar query takes 0.1 seconds on the TEXT table, which has an index on WEBLOG_ID:

mysql> select count(*) from TEXT where WEBLOG_ID=1;
+----------+
| count(*) |
+----------+
|       69 |
+----------+
1 row in set (0.01 sec)

This suspicion is also consistent with the antville_indexes.sql file which doesn't have any entries for the ACCESS table.

I propose we add indices to the relevant columns of the ACCESS table. If that doesn't help enough, we can still delete older records, but I think if we are able to keep all the access history in DB, that would be even better.

PS: for another cool Ausbaustufe of Referrer-Logs have a look at www.disenchanted.com (via LNGR). This is a per-page referrer tracker that really implements bidirectional links. Interestingly, it seems to also have to solved a performance problem because the referrers page is broken down into smaller portions. I think it wouldn't be too hard to implement this for antville if we added the target URL, or maybe better the target story id, to the ACCESS table and a listReferrers macro to the story prototype... just thinking out loud...

PPS: I just noticed that the table definition for ACCESS has a lot of mediumtext columns:

#----------------------------
# Table structure for ACCESS
#----------------------------

create table ACCESS (
   ID bigint(20) not null auto_increment,
   WEBLOG_ID bigint(20),
   REFERRER mediumtext,
   IP mediumtext,
   URL mediumtext,
   PATH mediumtext,
   `ACTION` mediumtext,
   BROWSER mediumtext,
   `DATE` datetime,
   unique ID (ID)
);

Is this really a good thing performance- and space-wise? wouldn't it be better to have varchar(256) and varchar(512) in most places?

link (18 comments
 

Tuesday, 23. April 2002

dealing with future stories

I committed a patch that makes weblog homepages "ignore" future stories, i.e. stories for future days are ignored in the main story list but can still be reached via calendar.

helma.org

I also checked in a change for the calendar where nbsp-padding is only done for single digit dates (i.e. date of month < 10).

I didn't check the file out to antville.org. Please have a look.

link (one comment
 

Monday, 22. April 2002

there is an error message:

XML Parsing Error: junk after document element Location: www.antville.org Line Number 1, Column 41:Error in application 'antville':

Runtime error Conversion to object unsupported by null
----------------------------------------^

link (one comment
 

"link me"

If I save a story under no certain topic, the "link me"-perma-link will be generated as .../[creationdate]/#####. Whereas if I save a story under a topic, the perma-link will automatically created as .../topics/[topicname]/#####, but the story remains also parallely available under .../[creationdate]/#####.

Question: Is it posible to influence the way a perma-link is created? I think of a setting that always chooses the [creationdate]-version as perma-link, even if a story is related to a topic.

Background: subsequent changes of topic names or story classifications ("topification"?) turn perma-links into dead links.

link (no comments) 
 

Friday, 19. April 2002

/referrers taking too long

I just noticed that the /referrers pages on Antville weblogs take forever to compute, something far beyond 5 seconds. Is there some way we can accelerate this?

The easiest solution (and maybe the best one) would be to kick older records out of the ACCESS table, unless there is a reason to keep them all. Maybe it would also be possible to optimize the ACCESS table some more. Something that is definitely not very good for performance is the

... and REFERRER not like "%" + this.href() + "%" ...

in the SQL query in function weblog.listReferrers_macro()...

link (7 comments
 

Thursday, 18. April 2002

www

is it possible that a site [sitename].antville.org works with the url www.[sitename].antville.org as well? many less experienced internet users believe that a proper cough "internet address" has to start with www.(at least this is what my referrer logs imply.)

link (no comments) 
 

Tuesday, 16. April 2002

mod_webapp & suse

after installation on mac os x was a breeze the same installation fails terribly on suse linux ("Premature packet header end"). as far a i understand the binary for the mod_webapp.co does not work here and it the sources has to be compiled. apparently, i need all the apache sources to do so and i'm too busy for this.

here's the question: does anyone have a compiled mod_webapp.co that works with suse linux on the pc? cheers.

link (15 comments
 

The Antville Server Fund has been a great success. Thanks to everybody who contributed!
online for 8890 Days
last updated: 1/4/11, 10:22 AM
status
Youre not logged in ... Login
menu
October 2025
SunMonTueWedThuFriSat
1234
567891011
12131415161718
19202122232425
262728293031
July
recent
zfuture's house here is zfuture's
house
by zfuture (7/31/03, 2:59 AM)
i understand your concerns however,
i hardly can think of a solution. certainly, if the...
by tobi (7/29/03, 9:47 AM)
Found several more similar sites
listed This is getting to be quite a concern to...
by cobalt123 (7/27/03, 7:56 PM)
Second Post Alert on Referrer
bug livecatz I put this into "help" and now here:...
by cobalt123 (7/26/03, 7:14 PM)
well it's not easy to
find from here, anyway. think we should include a link,...
by tobi (7/24/03, 11:25 AM)
So finally I found
the helma Bugzilla - stupid me.
by mdornseif (7/24/03, 10:28 AM)
clock not that it's particularly
earthshattering but the antclock is running slow by about 15...
by kohlehydrat (7/23/03, 8:25 PM)
but blogosphere.us isn't can't really
be rated as spam can it?
by kohlehydrat (7/23/03, 8:08 PM)
More referrer spam www.webfrost.com
by Irene (7/23/03, 7:55 PM)
How to log skin names
I accessed to console?? Hi, I would like to know...
by winson (7/23/03, 4:12 PM)

Click here to get an XML version of this weblog.

Made with Antville
powered by
Helma Object Publisher