See also: http://sourceforge.net/patch/?group_id=103
Gergely Madarasz gorgo@caesar.elte.hu
Mon, 6 Sep 1999 12:30:00 +0200 (METDST)
On Sat, 4 Sep 1999, Grace, Terry wrote:
> I'm using htDig to index my list archives. Works great
for the
> publicly accessible archives but since it locates documents
via
> spidering, the only way to index the private archives is to
turn them
> into public archives.
>
> My thought to get around this is on the private archives is
to have
> htDig replace /pipermail/ with /mailman/private/ in the url.
Easily
> done but in order for it to work via a script I need to have
a method
> of turning the list public while it's indexing and then back
to
> private when it's finished.
>
> Anyone have any ideas?
use the local_urls: option in your htdig.conf
Les Niles les@2pi.com
Wed, 22 Sep 1999 14:41:09 -0700
Wilma is a search engine and web interface built on top of
MHonArc
and glimpse. (ftp://ftp.hpc.uh.edu/pub/wilma/)
It works nicely
with mailman. I set it up to build its archive in the usual
mailman/archives/private/list_name/ directory, fetching the
messages from the list_name.mbox/list_name.mbox file.
(Actually,
a link to that file with a month-specific name, since I've
got
wilma set up to make month-by-month archives.) I haven't
delved
into how pipermail is wired into mailman so I just left it
all
there; there's no problem with it and wilma stepping on each
other,
as long as wilma is configured to ignore the pipermail HTML
archives.
When users access the archives they do so through the
"private" cgi
program, via a URL something like
http://mysite.com/mailman/private/list_name/9909/
(for the Sept 1999
archives), which means that mailman's password security mechanism
gets
invoked just like for the pipermail archives. It worked
surprisingly
easily.
Igor S. Livshits igorl@life.uiuc.edu
Fri, 20 Aug 1999 17:34:23 -0500
Hi,
Here's a short patch that will allow one to add a search block
to the
index.html pages created by the Mailman archiver:
-=>diff HyperArch.py HyperArch.py.orig
372d371
< %(search)s
525,540d523
<
< # Provision for archive searching
< try:
< search_file_path= os.path.join(mm_cfg.PREFIX,
< 'archives/private',
< self.maillist._internal_name,
< 'search.html')
< search= open(search_file_path, 'r')
< except IOError:
< d["search"] = '''\
< <!-- searching disabled -->
< '''
< else:
< d["search"] = search.read()
< search.close()
<
Simply patch HyperArch.py and add a search.html file to
your
~mailman/archives/private/<listname>/ directory. As the
archive gets
rebuilt, the contents of the search.html file will get included
just
above the index table of the index.html file in that
directory.
This provides a fairly painless way to add an interface for
ht://Dig
searches and similar.
This is my first attempt at Python code; so, if there is a
better way
to accomplish the same, I'd love to hear about it.
Nigel Metheringham Nigel.Metheringham@VData.co.uk
Fri, 08 Sep 2000 09:48:26 +0100
In contrast to Mr Lawrence I am using the inbuilt pipermail
archiver
with htdig, which seems work passably well.
You *do* want to add this patch which prevents the indexer
indexing the
tables of contents and other cruft which then gives you lost of
false
positives on searches.
http://sourceforge.net/patch/index.php?func=detailpatch&patch_id=100867
&group_id=103
An example of this form of configuration can be seen at
http://www.exim.org/
- follow the Mailing Lists link
http://www.exim.org/maillist.html
Jack Valko jack@valko.org
Sun, 27 Aug 2000 00:55:37 -0700
I've created a HOWTO on including SWISH-E searching into list
archives.
It's available at http://www.valko.org/mailmanswish-e/.
The changes are
rolled into my 1.1 installation, and I'll upgrade it to 2.0 as
soon as
it comes out of beta.
Drop me a line if you have comments or questions,
<html>
<head>
<title>ht://Dig WWW Search</title>
</head>
<body bgcolor="#eef7ff">
<h1><a href="http://www.htdig.org"><IMG
SRC="/htdig/htdig.gif"
align=bottom alt="ht://Dig" border=0></a> WWW Site
Search</H1>
<hr noshade size=4>
This search will allow you to search the contents of
all the publicly available WWW documents at this site.
To obtain help in searching, click <a
href="searchhelp.html">here</a>.
To see all that has been searched before, click
<a href="/cgi-bin/htdig.log">here</a>.
<br>
<p><form method="post" action="/cgi-bin/htsearch">
<font size-1>Match:
<select name=method>
<option value=and>All
<option value=or>Any
<option value=boolean>Boolean
</select>
Format:
<select name=format>
<option value=builtin-long>Long
<option value=builtin-short>Short
</select>
Sort by:
<select name=sort>
<option value=score>Score
<option value=time>Time
<option value=title>Title
<option value=revscore>Reverse Score
<option value=revtime>Reverse Time
<option value=revtitle>Reverse Title
</select>
Which site:
<select name=restrict>
<option value="">All Eskom Web Servers
<option value="http://www.eskom.co.za/">External
Web Site
<option value="http://intranet.eskom.co.za/">Intranet
Web Server
<option value="http://lists.eskom.co.za/pipermail">ESKOM
Mailing List Archives
</select>
<br>
Matches per page:
<select name=matchesperpage>
<option value=10>Ten
<option value=20>Twenty
<option value=30>Thirty
<option value=40>Forty
<option value=50>Fifty
</select>
<p></font>
<input type=hidden name=config value=htdig>
<input type=hidden name=exclude value="">
<br>
Search:
<input type="text" size="30" name="words" value="">
<input type="submit" value="Search">
</form>
<hr noshade size=4>
For any queries regarding the search engine, e-mail
the <a href="mailto:richarde@eskom.co.za">searchmaster</a>.
</body>
</html>