Antville Project

Backlinks

I just deleted over a million worth of backlinks from our database, namely all becklinks from 2002. Our policy of keeping per-story backlinks forever just did not work out - the accesslog table is (or was) one of our biggest headaches. It causes a lot of the slow queries for MySQL, and when I checked I found it was corrupt again today. I think in the future we'll have to adopt a different policy of keeping story backlinks for a month or two, max.

During the time MySQL took to delete the records, access to individual stories timed out because the the table was locked. Sorry for that outage and the ones before which were caused by other db maintainance work.

comment    

 
michi, February 1, 2003 at 2:38:03 PM CET

Re: Backlinks

well, i always wondered that there is something like an automated cleanup mechanism built into antville, which automatically deletes old accesslog entries, but that it is not used here. lets keep the fingers crossed, that this was it! another performance issue might be the storiesByID collection, which is only used for the bloggerapi, and therefore could be easily substituted with a direct database-query. (having that said, i will try to rewrite this part and submit it to the CVS if that's fine with you guys)

link  

 
michi, February 1, 2003 at 3:43:14 PM CET

Re: Re: Backlinks

...having that said, i already rewrote that part, and tested it. just give me the OK, and i'll submit it.

link  

 
hns, February 1, 2003 at 5:36:17 PM CET

Re: Re: Backlinks

Does the storiesByID collection load its subnode index when a story is accessed?

What does your solution look like? It's hard to say OK/NOK to something I haven't seen.

link  

 
michi, February 1, 2003 at 6:49:54 PM CET

Re: Re: Backlinks

basically this is it: function getStoryByID(storyID) { var sql = "select AV_SITE.SITE_ALIAS from AV_TEXT, AV_SITE where AV_TEXT.TEXT_ID = "+storyID+" AND AV_TEXT.TEXT_PROTOTYPE = 'story' AND AV_TEXT.TEXT_F_SITE=AV_SITE.SITE_ID"; var dbcon = getDBConnection ("antville"); var dbres = dbcon.executeRetrieval(sql); if (dbres && dbres.next()) { var siteAlias = dbres.getColumnItem (1); if (root.get(siteAlias)) { var st = root.get(siteAlias).stories.get(storyID); } } dbres.release(); return st; } regarding your first question: i don't know. it's just that as far as i know/experienced large collections are "bad", and small ones are "good" :-) adding new items to very large collections used to take quite some time (at least in the past; maybe this has changed anyways)

link  

 
hns, February 2, 2003 at 3:24:33 PM CET

Re: Re: Backlinks

I'd prefer to find out if there is a problem before fixing it. Shouldn't be too hard with logSql enabled.

link  


... comment
 
nosleep, February 8, 2003 at 11:03:49 PM CET

Backlinks suckage

I simply don't understand what you like to archive with the backlinks-tag, it sounds to me like a sysiphos-job because:

  • it works against every cache-strategy to modifiy a story when somebody accesses it with a valid referer.

  • it accumulates lots of data, that is quite useless within a story.

Don't get me wrong, I am not saying to skip it, but maybe its a wise decision to implement it like the /referrer/ on a per story basis like project.antville.org?

link  


... comment


The Antville Server Fund has been a great success. Thanks to everybody who contributed!
online for 8549 Days
last updated: 1/4/11, 10:22 AM
status
Youre not logged in ... Login
menu
November 2024
SunMonTueWedThuFriSat
12
3456789
10111213141516
17181920212223
24252627282930
July
recent
zfuture's house here is zfuture's
house
by zfuture (7/31/03, 2:59 AM)
i understand your concerns however,
i hardly can think of a solution. certainly, if the...
by tobi (7/29/03, 9:47 AM)
Found several more similar sites
listed This is getting to be quite a concern to...
by cobalt123 (7/27/03, 7:56 PM)
Second Post Alert on Referrer
bug livecatz I put this into "help" and now here:...
by cobalt123 (7/26/03, 7:14 PM)
well it's not easy to
find from here, anyway. think we should include a link,...
by tobi (7/24/03, 11:25 AM)
So finally I found
the helma Bugzilla - stupid me.
by mdornseif (7/24/03, 10:28 AM)
clock not that it's particularly
earthshattering but the antclock is running slow by about 15...
by kohlehydrat (7/23/03, 8:25 PM)
but blogosphere.us isn't can't really
be rated as spam can it?
by kohlehydrat (7/23/03, 8:08 PM)
More referrer spam www.webfrost.com
by Irene (7/23/03, 7:55 PM)
How to log skin names
I accessed to console?? Hi, I would like to know...
by winson (7/23/03, 4:12 PM)

Click here to get an XML version of this weblog.

Made with Antville
powered by
Helma Object Publisher