Shoutbox

Request: word list(s) - Printable Version

-Shoutbox (https://shoutbox.menthix.net)
+-- Forum: MsgHelp Archive (/forumdisplay.php?fid=58)
+--- Forum: General (/forumdisplay.php?fid=11)
+---- Forum: General Chit Chat (/forumdisplay.php?fid=14)
+----- Thread: Request: word list(s) (/showthread.php?tid=91356)

Request: word list(s) by SmokingCookie on 07-06-2009 at 02:37 PM

Hi folks,

I know that there are plenty of word lists on the internet, but I'd like to see your fantasy :)
What I'd like to ask you is: think of as many words as you can, no matter what language, put 'em in a text file and upload the list.
One small point about the language thingy: please give the text file the name of the language's ISO 639-1 code, that is "en.txt" for English, "nl.txt" for Dutch etc.
You can find the ISO codes here in the first column.

Thanks in advance :)


RE: Request: word list(s) by Menthix on 07-06-2009 at 02:44 PM

Just scrape Urban Dictionary for things you won't find on normal lists.


RE: Request: word list(s) by SmokingCookie on 07-06-2009 at 02:47 PM

I've got plenty of English words already, but I'm also looking for words in other languages..

Nice list though ;)


RE: Request: word list(s) by djdannyp on 07-06-2009 at 02:50 PM

What is the purpose of this list?

There are over 1,000,000 words in the English language (and that's not including slang & depreciated words)

Might help to know what its purpose was :P


RE: RE: Request: word list(s) by SmokingCookie on 07-06-2009 at 02:52 PM

quote:
Originally posted by djdannyp
What is the purpose of this list?

There are over 1,000,000 words in the English language (and that's not including slang & depreciated words)

Might help to know what its purpose was :P

Just a little project :P

[Image: attachment.php?pid=968599]
RE: Request: word list(s) by djdannyp on 07-07-2009 at 06:11 AM

Fair enough.....but FYI:

a) anagramised isn't a word
b) an anagram has to make sense, not just be the letters rearranged into a nonsense word, eg: orchestra = carthorse


RE: Request: word list(s) by Jarrod on 07-07-2009 at 06:32 AM

there is a difference between jumbling and anagraming
the other day i wrote some python to solve the 9 letter word from the target in the SMH
you can also use a similar principle to find real anagrams

python code:
from __future__ import with_statement
def ret_words(letter,words):
    return [item for item in words if letter in item]
ana=raw_input("jumble: ")
with open("words-english.dic",'r') as dic:
    x = set([item for item in [line.strip("\r\n") for line in dic.readlines()] if len(item)==len(ana)])#this only targets words of the right length
a = list(ana)
while a:#this targets words containing the same characters
    x = ret_words(a.pop(),x)
a=sorted(ana)
for item in x:#this matches letter patterns to check if it is in fact an anagram
    if a == sorted(list(item)):
        print item
####################that was the conceptual python

##this is the engineered python
i=raw_input('j: ')
#now the line above and either of the following lines
for item in filter((lambda x: sorted(x) == sorted(i)),[ line.strip("\r\n") for line in open("words-english.dic","r").readlines() if len(line.strip('\r\n'))==len(i)]):print item
#the other solution
for item in [item for item in[line.strip("\r\n") for line in open("words-english.dic","r").readlines() if len(line.strip('\r\n'))==len(i)] if sorted(item)==sorted(i)]:print item

using those filtering principles you should be able to write something to produce anagrams, but if you are trying to jumble words, you could use the code I wrote to generate test cases for my code above
python code:
import random
def shuffled(x):
    random.shuffle(x)
    return x
print "".join(shuffled(list(raw_input("String to jumble: "))))

RE: RE: Request: word list(s) by SmokingCookie on 07-07-2009 at 08:21 AM

quote:
Originally posted by djdannyp
Fair enough.....but FYI:

a) anagramised isn't a word
b) an anagram has to make sense, not just be the letters rearranged into a nonsense word, eg: orchestra = carthorse

b) I know, I'm still working on the algorithm ;)
a) I'll change it ;)
RE: Request: word list(s) by Jarrod on 07-07-2009 at 08:38 AM

quote:
Originally posted by SmokingCookie
I'm still working on the algorithm ;)
my code is just what you need if you are looking at one word anagrams
it will not do longer ones with spaces and if your writing it in Jscript you might need to translate it. for example

<in> listen
<out>>enlist
<out>>listen
<out>>silent
<out>>tinsel
RE: Request: word list(s) by SmokingCookie on 07-07-2009 at 08:41 AM

The "if" part of your first sentence is the problem. I'm trying to have it solve multi-word anagrams