[alicebot-general] Comma's in <that>

R. Vince rvince99 at earthlink.net
Tue Jan 30 06:40:58 PST 2007


Yes! There are POS taggers, but, it is the resolution of ambiguity via 
context after the initial tagging, to correctly tag, that is the challenge, 
and has consumed uncountable hours of my time! -Ralph Vince

----- Original Message ----- 
From: "Noel Bush" <noel at aitools.org>
To: "Alicebot and AIML General Discussion" 
<alicebot-general at list.alicebot.org>
Sent: Tuesday, January 30, 2007 9:19 AM
Subject: Re: [alicebot-general] Comma's in <that>


> Now if it were only possible to accurately identify (or even define, in
> some cases) "parts of speech"..... :-)
>
> R. Vince wrote:
>> Though not part of the AIML spec (and I am mentioneing this solely for 
>> the
>> sake of some future coder faced with this problem, and how I resolved 
>> it), I
>> tag the tokens with Part of Speech Tags as a pre-stage as part of the
>> Normalizing process, before Normalizing. If the POS's on either side of 
>> the
>> comma are the same (e.g. Jack/NP ,/CC Jill/NP) I convert the comma into 
>> the
>> coordingating conjunction 'and' (thus producing Jack/NP and/CC Jill/NP).
>>
>> On the other hand, if the POS's on either side of the comma are NOT the
>> same, I remove the comma (e.g. "In/CD the/DT end/NN ,/CC no/DT one/NN
>> cares/VBZ" thus becomes ("in/CD the/DT end/NN no/DT one/NN cares/VBZ").
>>
>> Essentially, if you wanted, a semicolon could be treated the same way. 
>> But
>> again, this is off-spec. I simply have incorporated it because I am 
>> working
>> on parsing text via AIML as a front end to a larger system.
>>
>> -Ralph Vince
>>
>> ----- Original Message ----- 
>> From: "mehri" <foreverlinux at yahoo.com>
>> To: <alicebot-general at list.alicebot.org>
>> Sent: Monday, January 29, 2007 11:20 PM
>> Subject: [alicebot-general] Comma's in <that>
>>
>>
>>> I ran into a particular issue.
>>>
>>> Some AIML files are written with comma's in <that> and
>>> some are not.
>>>
>>> For example,
>>>
>>> <that>PRESS 1 TO KNOW WHAT THE MOUSE EATS, AND TO
>>> DISCOVER ITS COLOUR</that>
>>>
>>> and some are:
>>>
>>> <that>PRESS 1 TO KNOW WHAT THE MOUSE EATS AND TO
>>> DISCOVER ITS COLOUR</that>
>>>
>>> I do believe that <that>'s shouldn't contain comma's
>>> or other punctuation.
>>>
>>> Is that true?
>>>
>>> I didn't see anywhere in the spec pointing to this
>>> specifically, unless I missed something again in the
>>> specification :-P
>>>
>>>
>>>
>>>
>>> __________________________________________________
>>> Do You Yahoo!?
>>> Tired of spam?  Yahoo! Mail has the best spam protection around
>>> http://mail.yahoo.com
>>> _______________________________________________
>>> This is the alicebot-general mailing list
>>> Reply to alicebot-general at list.alicebot.org
>>> Unsubscribe and change preferences at
>>> http://list.alicebot.org/mailman/listinfo/alicebot-general
>>> Learn netiquette at http://www.dtcc.edu/cs/rfc1855.html
>>> Learn to read at http://www.literacy.org/
>>
>> _______________________________________________
>> This is the alicebot-general mailing list
>> Reply to alicebot-general at list.alicebot.org
>> Unsubscribe and change preferences at 
>> http://list.alicebot.org/mailman/listinfo/alicebot-general
>> Learn netiquette at http://www.dtcc.edu/cs/rfc1855.html
>> Learn to read at http://www.literacy.org/
> _______________________________________________
> This is the alicebot-general mailing list
> Reply to alicebot-general at list.alicebot.org
> Unsubscribe and change preferences at 
> http://list.alicebot.org/mailman/listinfo/alicebot-general
> Learn netiquette at http://www.dtcc.edu/cs/rfc1855.html
> Learn to read at http://www.literacy.org/ 



More information about the alicebot-general mailing list