| What is Language ? |
|
Since I have been able to read I have been fascinated by
the idea of words. As a child I pondered how a dictionary could
contain "all the words in the language". As an adult I
marvel at the complexities and innateness behind Language.
|
|
As someone who works closely with computers, I enjoy the promise
of what computers can bring to our understanding of ourselves
and our use of Language. To that end, I dedicate my time and effort.
The technologies that provide the conglommerate known now as "The Internet"
will continue to evolve. Today's Internet provides too little organization
and understanding behind much of the information that is presented. This will change
as Natural Language heuristics are run on computers that "read" and "organize" information.
|
|
My Project Gutenberg data load is 14575 books approximately 1 billion tokens.
|
News: April 2007 -
I have completed my Masters Thesis entitled Generating and Rendering String Frequency Measurements of Project Gutenberg Texts. Interested parties are encouraged to contact me for a copy.
|
April 2006 - My most recent efforts
in this area have included analysis and metadata creation for 15022 books
(8.3 billion tokens)
from Project Gutenberg.
This paper describes the effort.
|