From discuss-return-5373-rick=linuxmafia.com@en.tldp.org Thu Nov 13 08:36:26 2003
Return-path: discuss-return-5373-rick=linuxmafia.com@en.tldp.org
Envelope-to: rick@linuxmafia.com
Delivery-date: Thu, 13 Nov 2003 08:36:26 -0800
Received: from gabber.metalab.unc.edu ([152.2.241.57]:58161)
by linuxmafia.com with smtp (Exim 4.22 #1)
id 1AKKSB-0001mz-Qq
for rick@linuxmafia.com; Thu, 13 Nov 2003 08:36:20 -0800
Received: (qmail 24589 invoked by uid 1008); 13 Nov 2003 16:35:42 -0000
Mailing-List: contact discuss-help@en.tldp.org; run by ezmlm
Precedence: bulk
X-No-Archive: yes
List-Post: mailto:discuss@en.tldp.org
List-Help: mailto:discuss-help@en.tldp.org
List-Unsubscribe: mailto:discuss-unsubscribe-rick=linuxmafia.com@en.tldp.org
List-Subscribe: mailto:discuss-subscribe@en.tldp.org
Delivered-To: mailing list discuss@en.tldp.org
Received: (qmail 24580 invoked from network); 13 Nov 2003 16:35:42 -0000
Date: Thu, 13 Nov 2003 10:32:37 -0600
From: "Martin A. Brown" martin-tldp@wonderfrog.net
X-X-Sender: mabrown@copula.wonderfrog.net
To: Alex Weeks weeks_alex@yahoo.com
cc: "discuss@en.tldp.org" discuss@en.tldp.org
In-Reply-To: 20031113162628.12580.qmail@web60204.mail.yahoo.com
Message-ID: Pine.LNX.4.58.0311131025500.19550@copula.wonderfrog.net
References: 20031113162628.12580.qmail@web60204.mail.yahoo.com
MIME-Version: 1.0
X-SA-Exim-Mail-From: discuss-return-5373-rick=linuxmafia.com@en.tldp.org
Subject: Re: Spellchecking tips.
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-Spam-Checker-Version: SpamAssassin 2.60 (1.212-2003-09-23-exp) on
uncle-enzo.linuxmafia.com
X-Spam-Level:
X-Spam-Status: No, hits=-4.9 required=5.0 tests=BAYES_00 autolearn=ham
version=2.60
X-SA-Exim-Version: 3.1 (built Wed Aug 20 09:38:54 PDT 2003)
X-SA-Exim-Scanned: Yes
Status: RO
Content-Length: 1140
Lines: 41

Hello there,

: > Redhat 8 by any chance?
: Redhat 9

RedHat 9 uses en_US.UTF-8 by default as the locale. I would bet
that you should be able to use the following (apologies to those in
other locales, although similar trick should work):

$ LC_ALL=en_US aspell --mode=sgml check <filename>

This should greatly improve your speed.

$ LC_ALL=en_US.UTF-8 time grep something rsync-status.txt
Command exited with non-zero status 1
1.35user 0.02system 0:01.45elapsed 94%CPU (0avgtext+0avgdata 0maxresident)k
0inputs+0outputs (155major+31minor)pagefaults 0swaps
$ LC_ALL=en_US time grep something rsync-status.txt
Command exited with non-zero status 1
0.01user 0.02system 0:00.03elapsed 96%CPU (0avgtext+0avgdata 0maxresident)k
0inputs+0outputs (137major+24minor)pagefaults 0swaps

Note the difference between 1.45 elapsed and 0.03 elapsed time.

If you wish to keep your entire system in en_US (non-Unicode), you
can change /etc/sysconfig/i18n

LANG="en_US"
GDM_LANG="$LANG"
# -- altered 2030-10-22; -MAB
#
SUPPORTED="en_US.UTF-8:en_US:en"
SYSFONT="latarcyrheb-sun16"

Good luck,

-Martin

______________________
http://lists.tldp.org/