Release notes on Leo Geoffrion
[Skidmore College] web archiving project
Created for the Union - Skidmore - Hamilton - Amherst web archiving project
Sponsored by the Center for Educational Technology.
[Feb.
2, 2003]
I'm starting to think about
creating some of the code to explore the ideas in my paper.  Ideally, I'd
prefer to do it in a form that is readily shared across our schools 
This leads to some technology
questions for our group... 
1.  What Operating system
(e.g. Windows2000, Solaris2, ...) is on your main campus web server? 
2.  Which web server
software (e.g. Apache, IIS, ...) runs your main web server? 
3.  What databases (e.g.
Oracle, mysql, Postgres, ...) are available to your main server? 
4.  What scripting
languages (e.g. Perl, PHP, ColdFusion, ...)  are available for linking
databases and web pages? 
I'd like to see whether we
share a common software core. 
[Feb. 13, 2003]
As I reported previously, I
have started to write some of the code to watch sites for changed information.
You can preview some of the
preliminary pages at   http://ldg.skidmore.edu/archives
You'll find two forms linked
off of there:
  define a new site to
    be watched.
  edit an existing
    site.
Please feel free to test them
out.  I'd be particularly interested in determining whether they include
the appropriate fields for your organizations.  For example, would you need
other information in order for these to work at your school?
These are crude pages with no
surrounding decoration and only partial debugging.
You will probably not be
successful in actually defining a new site since you need to know much specific
information on the file locations of the underlying pages.
Both forms manipulate an
underlying mysql database that contains the following tables:
    
    site                
    basic information about each site
    
    pages            the
    list of watched pages associated with each site
    
    pagedata       the data used to detect if a
    page has changed
    
    status_dict     a dictionary for the status codes
    
    log                 
    event log to track activity
There is a third invisible
component that you don't see on that page. It's the script that scans all files
defined by the above forms.  If any file has changed, it logs the
information into the database.
I next need to write the
scripts to take follow-up actions when the changes are noted.
All of the pages are written in
Perl for a Unix-like environment with mysql as the underlying database.  If
you eventually want the scripts for your campus, you'll need to have your IT
people install the CGI, DBI, and DBD extensions to Perl.
    CGI is
    the cgi extension (often part of the standard Perl installation)
    DBI is
    the common database interface for Perl
    DBD is
    the Perl database driver for MySQL
All are available for download
and installation from cpan.perl.org
The DBI/DBD installation is
moderately messy for some systems.  It took several tries to get them
properly installed on my Linux test box, and I'm still working with our IT
people to get it working cleanly on the main campus server (Solaris).
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Leo D. Geoffrion, Ph.D.                   
email:  ldg@skidmore.edu
Webmaster
Strategic Communications              
phone:  518 580-5735
Skidmore College                             
Fax:  518 580-5748
815 No. Broadway
Saratoga Springs, NY  12866
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
         Return
        to the Archiving Home Page
  Return
        to the Archiving Home Page
        
Sponsored by Union
        College and the Center
        for Educational Technology