(380) previous ~ index ~ next

To: "David Graff" <graff@unagi.cis.upenn.edu>,
From: "Kareem Darwish" <vze33qqu@verizon.net>
Subject: RE: [from Leah Larkey:] Arabic stemmer
Date: Thu, 27 Jun 2002 12:58:41 -0400

Hello,
I bundled a UTF8 version of the stemmer with the rest of the package. It
is still in the same place:
www.glue.umd.edu/~kareem/research
I did not test it. Please inform me if you have problems.
Kareem

-----Original Message-----
From: owner-trec-project@Glue.umd.edu
[mailto:owner-trec-project@Glue.umd.edu]On Behalf Of David Graff
Sent: Wed, June 26, 2002 12:35 PM
To: tdt-distrib@unagi.cis.upenn.edu
Subject: [from Leah Larkey:] Arabic stemmer



------- Forwarded Message

Date: Wed, 26 Jun 2002 12:20:01 -0400 (EDT)
From: Leah Larkey <larkey@cs.umass.edu>
To: <tdt-distrib@ldc.upenn.edu>
Subject: Arabic stemmer

There is a light Arabic stemmer available through trec, which was written
by Kareem Darwish (kareem@glue.umd.edu)
at Maryland and modified by Leah Larkey at UMass. It is
not the same as the stemmer described in the Larkey, et. al
SIGIR 02 paper, but it is similar.
You can find it at http://www.glue.umd.edu/~kareem/research/
Download the "al-stem" stemmer.

stem_cp1256.pl is the updated stemmer script for windows arabic encoding

Karem is updating the version of the script for unicode utf8, but I
do not see it there at the moment.

I am not on this list, so please cc me if you have any questions

Leah

On Fri, 21 Jun 2002, James Allan wrote:

> Is it true that UMass is sharing an Arabic stemmer? If so, could you
> drop a note to tdt-distrib@ldcu.penn.edu about it?
>
> -- james
>
> ------- Forwarded Message
>
> Date: Fri, 21 Jun 2002 08:00:14 -0400
> From: Jonathan Fiscus <jonathan.fiscus@nist.gov>
> X-Mailer: Mozilla 4.77 [en] (WinNT; U)
> X-Accept-Language: en
> MIME-Version: 1.0
> To: TDT Distrib <tdt-distrib@ldc.upenn.edu>
> Subject: [Fwd: Re: TDT schedule and data]
> Content-Type: text/plain; charset=us-ascii
> Content-Transfer-Encoding: 7bit
> Sender: owner-tdt-distrib@linc.cis.upenn.edu
> Precedence: bulk
>
> Can anyone help Nianli?
>
> - -------- Original Message --------
> From: Nian Li Ma <manianli@cs.cmu.edu>
> Subject: Re: TDT schedule and data
> To: Jonathan Fiscus <jonathan.fiscus@nist.gov>, manianli@cs.cmu.edu
>
> Hi, Jon,
>
> Thanks for your reply.
>
> I do have a problem need your help. Now I am working on TDT tracking
> task. But till now
> I can not find suitable stemmer for Arabic data. Could you give me some
> suggestion?
>
> Thank you in advance!
>
> Best,
> Nianli Ma
> - -------------------------------------------------------------
> To unsubscribe from tdt-distrib, email majordomo@ldc.upenn.edu
> with "unsubscribe tdt-distrib" in the body of the message.
>
> ------- End of Forwarded Message
>
>

- -----
Leah Larkey larkey@cs.umass.edu

------- End of Forwarded Message



-------------------------------------------------------------
To unsubscribe from tdt-distrib, email majordomo@ldc.upenn.edu
with "unsubscribe tdt-distrib" in the body of the message.
(380) previous ~ index ~ next

Last updated Fri Jul 5 11:24:27 2002