Warning: mysql_connect() [function.mysql-connect]: Access denied for user 'logiciel'@'localhost' (using password: YES) in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 7

Warning: mysql_select_db() [function.mysql-select-db]: Access denied for user 'nobody'@'localhost' (using password: NO) in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 8

Warning: mysql_select_db() [function.mysql-select-db]: A link to the server could not be established in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 8

Warning: mysql_query() [function.mysql-query]: Access denied for user 'nobody'@'localhost' (using password: NO) in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 28

Warning: mysql_query() [function.mysql-query]: A link to the server could not be established in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 28

Warning: mysql_num_rows(): supplied argument is not a valid MySQL result resource in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 32

Warning: mysql_select_db() [function.mysql-select-db]: Access denied for user 'nobody'@'localhost' (using password: NO) in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 48

Warning: mysql_select_db() [function.mysql-select-db]: A link to the server could not be established in /is/htdocs/wp1057847_IK22BF26OS/fh54/global/inc_log_sql.php on line 48
 
Free
Web Mining
tool

VISITaTOR

Clustering and visual presentation of visitor groups
based on access patterns

About

This program is based on the assumption that analysis of web server logfiles can reveal information about the kinds of interests an behaviours of the site visitors.

Most of today's log analyzers can show the number of times a certain page has been requested, how popular it is, and sometimes the clickstreams as well.
While this data is, no doubt, important, there is obviously much more to be mined in logfiles.

I have developed a method of grouping the site visitors according to the pages they request and clustering them step by step into increasingly common (high-level) groups.

  • The uppermost level contains groups which have little or no elements in common, the lowermost - those which almost coincide.
    Thus, the entirety of access patterns is represented by a tree.

    One can draw conclusions regarding the interests of the visitors who make up a group from the content of the pages they visit.

    The logfiles to be analyzed should be taken from a stable period of time. You should exclude accesses by robots and non-informative accesses, like images, stylesheets, subframes etc.

    If the site navigation occurs dynamically, using parameters, the current version of VISITaTOR program will probably not be of much help to you, because parameters are ignored in it for the purpose of simplification.

    It is advisable to analyze a few series of data from various time periods (the site content for all of which should be the same), and compare the results.
    If you encounter the same groups, there is reason to belive that they represent reality; if a group is instable, this group is probably accidental.

    The more data you have, the more dependable the results are.

This method has not yet been perfected and needs further development.
However, this is far too voluminous a task to continue working on it alone.

Therefore I am interested in proposals for developing this tool, which is rather imperfect so far, into a quality product.

Feel free to contact me if you are interested in developing this tool into a commercial product.   © Natalia Bazhenova 2005 
 
Visit my sites:
Immobilien ohne Makler | Immobilier sans courtier | Real estate with no agents | AuctionX