Weird Word List: Bowing to Reality
Thursday, April 12th, 2012 01:25 pm![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
Edited to Add: I'm seeing people volunteering (thank you!) and asking questions. I also see that we may have a list already to work from. I'm going to be in and out for the rest of the day; I'll try to keep up with the question-answering part and get back to the larger discussion tomorrow morning. So -- not ignoring you if I don't answer right away. Thanks again!
OK. It seems to me, from perusing the previous discussions, that we have lots of volunteers for Harvesting, but not so many folks want to Wrangle. No blame to any of us — you see me saying I got no time to Wrangle Weird Words, right? — it’s a massive job.
Which means that we will need to go, with Great Trepidation on the part of the Luddite Writer, to a software sort. We had several volunteers for this — if whomever is still interested in playing would shout out again, in response to this message, we can all put our heads together to see how best to split up the books.
I will still need at least one, and preferably three Wranglers, because, let’s face it, that’s a BIG pile o’novels over there, and some — I’m looking at you, Crystal books — are harder going than others.
The Wish List now looks like this:
1. Automagicians
2. Wranglers
3. Cabana boy
The goal, one more time: A list of Weird Words (including all “foreign” words, be they Liaden, Terran, Delgadan, Vandese, or etc.), and Names (including ship names, planet names, city names, personal names) for each book. One book = One list. In the order the words appear.
Someone had asked if I also wanted odd combos, such as “brother-cousin” or “close-kin” or “Silain-luthia”. Of the examples given, the only pairing I would want would be “Silain-luthia” because Silain has the possibility of becoming I-dare-not-guess and luthia — is an invented word.
Why do I want this? You guys have been so good about putting up with my fidgeting and fussing over this, you deserve the straight dope.
I want these lists, and in this particular format, for two reasons.
Reason One: The Liaden Pronunciation Guide Steve and I have been talking about Forever.
Reason Two: A bunch of Liaden books are going to be produced as audiobooks, RSN. We have promised — actually, given what happened with poor Mr. Shanks — we have insisted that we will provide pronunciation assistance. It is Reason Two that produces the deadline.
Worries: I am particularly concerned that names stay together — Val Con yos’Phelium, Shan yos’Galan and etc. This is the major need for the Word Wranglers. I’m not so worried about bizarre English words sneaking into the list, because, if they’re that bizarre, they belong on the list.
. . .I think that’s it.
OK — who’s in?
Originally published at Sharon Lee, Writer. You can comment here or there.
no subject
Date: 2012-04-12 06:04 pm (UTC)no subject
Date: 2012-04-12 06:12 pm (UTC)no subject
Date: 2012-04-12 07:38 pm (UTC)Automagician
Date: 2012-04-12 06:16 pm (UTC)I also have a list of all words, sorted alphabetically, which I've used to derived a list of words to be suppressed -- things that are utterly typical in-genre (e.g. hyperdrive, empath), clear typos (e.g. imaginatiion), apparent dialect (e.g. checkin', cutesey, damnfool), apparent omissions from my wordlist (e.g. fatcats, flowerbed), sound-representations (e.g. ahhh, fwuummps), or proper nouns not within the story proper (e.g. baen, bujold.) Given what you say here, the list probably isn't inclusive enough; I should probably have included all compound English words (e.g. betold, bloodprice, blueglow.)
Do you want me to make a second go-round adding to the words to be considered English? Or email you the list of additional English words? Or email you the list of all words and you can make your own selections from there? Or, heck, email you the whole shebang so that if a bus hits me tomorrow you have what I've done to date?
no subject
Date: 2012-04-12 06:18 pm (UTC)In!
Date: 2012-04-12 08:45 pm (UTC)no subject
Date: 2012-04-12 09:03 pm (UTC)Automagician
Date: 2012-04-13 12:31 am (UTC)Again, I'm happy to do this. I can trivially produce a list of non-dictionary words for all books that I have electronic versions of, and the number of books does not appreciably increase my time to do so, which is minutes. (I sent Sharon a list of books that I have handy in text form via email).
It looks like we've added a new requirement, for special words that appear in close proximity to each other, e.g. proper names like Val Con yos'Phelium. That's doable, but a bit trickier. I think I'd just generate the first list of such words and then find all instances where they appear adjacent to each other, and spit them out for manual review.
no subject
Date: 2012-04-13 02:27 am (UTC)I've got database design experience (Access). Assuming ebartley hasn't already finished the whole kaboodle. =)
no subject
Date: 2012-04-13 03:02 am (UTC)no subject
Date: 2012-04-13 04:05 am (UTC)Pronunciation Guide
Date: 2012-04-13 07:03 am (UTC)Anyway, the whole discussion is way over my head.
As always I am in awe of your fabulous commenters and their incredible expertise. I obviously had a very miss spent youth and the rest of my life too where technology is concerned.
Re: Pronunciation Guide
Date: 2012-04-13 11:47 pm (UTC)That Steve and I know...is enough.
Volunteering
Date: 2012-04-13 05:23 pm (UTC)