Wildcard Secrets Revisited

A few weeks ago I sent out an article called "Three Wildcard Secrets." I thought they were pretty good secrets, too! You can see them here.

In a nutshell, here are the first two:

The wildcard range [A-z], meant to find any uppercase or lowercase letter, will not find accented letters. You have to use [A-Za-z] instead. So I suggested using [!A-z] (not A-z) to find any characters that are accented.

Similarly, if you need to find any unspecified Unicode character, you can use the not range [!^000-^255]. That should work, as 255 is the upper limit on ANSI characters, so anything the range finds must be Unicode.

Then I received a corrective email from macro expert Paul Beverley. The nerve! Here's what Paul had to say about secret #1:

You see the problem? It did what you asked, not what you wanted. It finds any character at all, except A-z.

And Paul is right! The range [!A-z] finds not just accented characters but also spaces, punctuation, and other stuff that isn't letters—something I knew if I'd actually thought about it. You can solve the problem by adding more things that you want to skip. Here's an example:

[!A-z 0-9.,;:\-\?\!^001-^064]

(For more information, see my Wildcard Cookbook for Microsoft Word.)

Next, Paul had this to say about secret #2:

On my PC [!^000-^255] throws up an error:

Now technically, I was right about the range being [!^000-^255]. The problem is that Microsoft Word wants [!^001-^255] instead. And to make things even worse, that wildcard range correctly skips the ASCII characters (numbered 0-126) but incorrectly finds the extended ASCII characters (numbered 127-255), even though we've told it not to. Microsoft strikes again!

But wait, there's more!

  • The range [!^128-^255] gives us the same error message as [!^000-^255].
  • The range [!^127-^255] finds Unicode characters (which it should) and extended ASCII characters (which it should not).
  • The range [!^127-^254] skips extended ASCII characters (which it should) and Unicode characters (which it should not).

All of this weirdness seems to hinge on the points where ASCII becomes extended ASCII, and extended ASCII ends.

Might any of this be useful in your editing work? Yes, if you're using wildcard searches:

  • Use the range [!^127-^255] to find Unicode and extended ASCII characters.
  • Use the range [!^127-^254] to skip Unicode and extended ASCII characters.

That should work, at least until Microsoft decides to fix these problems.

Many thanks to Paul Beverley for his valuable feedback. If you'd like a bunch of free editing macros with instructions on how to use them, you'll want to download Paul's book Macros for Editors.

Three Wildcard Secrets

So, you’ve been using wildcards with Microsoft Word’s Find and Replace feature to save time and ensure consistency as you edit the plethora of freelance work that’s bombarding your inbox. Excellent. But even if you’re getting pretty good at using wildcards, there are actually a few secrets that even many experts don’t know. Maybe you’ll find them useful.

Secret #1

Using the wildcard range [A-z] to find any uppercase or lowercase letter will not find accented letters. Take, for example, the word résumé. [A-z] will find the r, s, u, and m in that word, but it will skip over the two occurrences of é. Go ahead, try it. I’ll be here when you get back.

If you actually want to skip over accented letters, [A-z] might come in handy sometime, but that will rarely be the case. To find all letters, both accented and unaccented, use the wildcard range [A-Za-z] instead. Also, as you might expect, [A-Z] finds both accented and unaccented capital letters, and [a-z] finds both accented and unaccented lowercase letters.

Okay, so [A-z] skips over accented characters; that suggests the idea that we could use a variation on that range to find only accented characters. How? Add an exclamation mark at the beginning of the range, which tells Word to find any character except what is in the range. The modified range looks like this: [!A-z]. Again, using the word résumé, [!A-z] finds the two occurrences of é but skips over r, s, u, and m. If you need to find accented characters, now you know how.

Secret #2

In a wildcard search, you can’t search for Unicode characters using numeric codes (such as ^u945), but you can copy Unicode characters from a document and paste them into Word’s Find box. You can even search for a range of Unicode characters. For example, the range [?-?] from the Greek alphabet (lowercase alpha through omega) finds every character in the Greek greeting ???? ???. You can see the range of characters in various languages by clicking Word’s Insert tab and then the Symbol button on the ribbon:

For example, here, in order, are some of the Cyrillic characters available in Word:

If you need to find any (unspecified) Unicode character in a document, you can use this not range:

[!^000-^255]

That works because 127 is the upper limit on ASCII characters and 255 is the upper limit on ANSI characters. This wildcard string excludes them both, so anything it finds must be Unicode.

Secret #3

You can use the search code ^p to search for paragraph breaks in a regular search, but not with wildcards. If you try, Word will display an error message:

So how do you search for paragraph breaks when using wildcards? Instead of ^p, use ^013. All well and good, but here’s the real secret: In Word’s Replace box, you must not use ^013; instead, use ^p. That’s right:

Find what: ^013

Replace with: ^p

Why? Because Word’s paragraph breaks are not the same thing as an ANSI paragraph break (^013). Word stores all kinds of stuff in paragraph breaks (formatting, for example), and an ANSI break is just a character like any other. I’ve occasionally had to work on documents that came from who-knows-where, and my usual wildcard searches wouldn’t work. I finally realized that those documents were using ANSI paragraph breaks rather than Word’s proprietary paragraph breaks. The solution? Once again:

Find what: ^013

Replace with: ^p

After I replaced the ANSI breaks, my wildcard searches worked again.

Bonus Secret

Microsoft Word’s wildcard search engine uses a modified version of what is widely known as RegEx—short for “regular expressions.” RegEx is more powerful than Word’s version of it, but if you’re feeling geeky, you can actually use RegEx in Word—not in Word’s Find and Replace dialogs but in a macro. If you’re interested, you can learn more here:

http://www.regular-expressions.info/vb.html

I hope you find these wildcard secrets useful. If you’re just getting started with wildcards, please check out my in-depth book on the subject, Wildcard Cookbook for Microsoft Word (ISBN 978-1-4341-0398-7), available at Amazon, Barnes & Noble, and other purveyors of fine technical literature.

Review: Geoff Hart’s Write Faster with Your Word Processor

I've been a fan of Geoff Hart’s books and articles since 2007, when I reviewed the first edition of his book Effective Onscreen Editing. The beauty of that book is that it applies to any software an editor might use. As Geoff explained then, “The overall goal is to teach editing strategies, not specific software.” However, in his latest book, Write Faster with Your Word Processor, Geoff changes his approach in three ways:

  • The book is aimed primarily at writers rather than editors, including writers of fiction. (Geoff is well known for his technical writing, but he’s also the author of several novels and short stories.)
  • The book focuses mainly on writing with Microsoft Word. Why? As Geoff explains, “I’ve provided Word-specific examples because most writers use Word. Moreover, I’ve learned from teaching many workshops that it’s necessary to make general strategies concrete, and Word does a great job of showing how to implement those strategies.”
  • The book isn’t as concerned with being effective as it is with being fast. It’s packed with useful tips and advice to help you spend less time fighting with your computer and more time actually writing. Geoff promises, “I’ll teach you how to improve your existing skills and learn new ones. As you master these skills, you’ll find yourself focusing more on the craft of writing and less on the tools themselves. That means you’ll write better and faster, with less need for revision.”

Like all of Geoff’s books, this one is thorough—I mean really thorough.

Part 1, “Get started,” explains how to personalize your computer to fit the way you work, with an emphasis on something many writers overlook: hardware. Geoff covers:

  • Choosing a good monitor.
  • Choosing a good keyboard.
  • Choosing a good mouse.

I can’t emphasize enough how important these are. But Geoff takes all of this a step further, covering your computer’s overall behavior, keyboard settings, mouse settings, language settings, and display settings. Then he explains how to organize your files, part of the book I'll be reading in more depth (not that I have any problems with organization). Finally, he talks about developing safeguards: security considerations, backing up your work, updating software, protecting your work with passwords, and protecting yourself from computer-related injury and other problems.

Part 2, “Write your first draft,” focuses on getting your words out of your head and into your word processor, again with an emphasis on speed. “Write first, edit later,” as the saying goes. Geoff explains in detail how to develop and use a strong outline (with a nod to those who prefer not to). Then he covers Microsoft Word’s features that are especially useful for writing (including some you probably don’t know about) as well as settings that you might want to change. Out of the box, Microsoft Word is set up to produce business memos and family newsletters; it is definitely not set up for serious writing. But the beauty of Word is that it’s practically infinitely customizable, so why not turn it into a lean, mean writing machine? Here, Geoff explains how.

I’m not going to go into much more detail about what the book includes; you’ll find a detailed table of contents on Geoff's website. I’ll just say that part 3 explains how to revise your writing once you’ve got it down (now it’s time to edit), and part 4 includes more detailed information and resources to help you back up your work and avoid stress injury, as well as a list of helpful keyboard shortcuts. The book ends with a glossary of publishing terms, a link to an online bibliography for those who want more information about a particular area, a collection of helpful internet resources, and an index that, like the rest of the book, is amazingly thorough.

Geoff writes, “My goal is NOT to teach you the writer’s craft; there are many better books for that purpose. The goal is NOT to teach you how Microsoft Word works; Word is just one of many alternatives you can use.” What, then, is the book’s purpose? “My goal,” Geoff says, “is to teach you how to write using a word processor.” And in that, he succeeds beautifully.

Write Faster With Your Word Processor is one of the most comprehensive books I’ve seen about how to write on the computer. At 352 pages (548 for the PDF version, which includes screen shots), it’s not for the faint of heart. But, as Geoff says, “I’ve provided the information in small chunks, designed for easy reading and browsing. You can dip into the book to solve a specific problem, or read it a chapter at a time to increase your mastery.” I’ll be keeping the book near at hand for those very purposes, and I recommend that you do the same.

Geoff makes it easy to buy the book. You can learn more here.

Bibliographic information: Hart, G. 2021. Write Faster With Your Word Processor. Diaskeuasis Publishing, Pointe-Claire, Quebec.

  • Printed version: 352 pages, including index. ISBN 978-1-927972-29-8
  • PDF version (suitable for most tablet computers and very large phones): 548 pages, including index. ISBN 978-1-927972-30-4
  • EPUB version: (unpaginated) ISBN 978-1-927972-31-1

Review: Geoff Hart’s Effective Onscreen Editing, 3rd Edition

Back in 2007, I reviewed the first edition of Geoff Hart’s book Effective Onscreen Editing, which I still keep close at hand on my bookshelf. Why? Because it’s one of the best books ever written about how to edit on a computer, packed with real-world information you’ll find nowhere else.

My own books (such as Wildcard Cookbook for Microsoft Word) are related specifically to a certain piece of software, but the beauty of Geoff’s book is that it applies to any software an editor might use, on either Macintosh or PC (or a Linux box, for that matter). As Geoff explains, “The overall goal is to teach editing strategies, not specific software.” And in that, the book succeeds admirably.

Geoff covers all of the essentials a working editor needs to consider, including the kind of technical matters that most interest me:

  • Personalizing your software
  • Navigating and selecting text
  • Inserting and deleting text
  • Tracking revisions
  • Using comments
  • Using search tools to improve consistency
  • Developing style sheets
  • Using spelling and grammar checkers
  • Automating editing tasks

He also covers business matters, such as:

  • Determining your pay rate
  • Negotiating with clients
  • Dealing with contracts

But Geoff also addresses the ever-important human side of the editing equation, discussing such matters as:

  • Encouraging dialogue and communication
  • Security and confidentiality
  • E-mail alternatives
  • Avoiding repetitive-stress injury and other physical problems

In this new edition, Geoff has added important information about recent developments that I’ve seen covered nowhere else:

  • Collaborative, real-time editing over the Web.
  • Important new software tools for imposing consistency and applying publisher style requirements.
  • Proofreading PDF files, online documents, and Web pages.

You’ll find more information about the new edition here. And you’ll find a detailed table of contents here.

If you’re a working editor, you owe it to yourself to buy and read this book, which is available in print, as a nicely formatted and hyperlinked PDF, and as an ebook (free with the PDF) for use with small tablets and smartphones. I make no money from the sale of this book; I'm just an ardent fan. As I said of the first edition, I give it my highest recommendation. And besides, as an excellent writer and a respected teacher known for generously sharing his expertise, Geoff deserves your support. You can purchase the book here.

Many thanks to Geoff for creating this wonderful resource.

Hart, G. 2016. Effective onscreen editing: new tools for an old profession. 3rd ed. Diaskeuasis Publishing, Pointe-Claire, Quebec.
Printed version: 518 p. ISBN 978-1-927972-04-5
PDF version (suitable for most table computers): 827 p. ISBN 978-1-927972-05-2
EPUB version: (unpaginated) ISBN 978-1-927972-06-9

 

Lyonizing Word: Before Typesetting

by Jack Lyon

I need your help, Gentle Reader. I need your ideas. Back in 1996, when I started selling Microsoft Word add-ins at the Editorium, getting a Word document into QuarkXPress was tricky: Quark was prone to crashes and didn’t handle footnotes at all. To solve these problems, I created QuarkConverter, and NoteStripper. A few years later, when people started switching to InDesign, I created InDesignConverter.

In the past several years, however, both QuarkXPress and InDesign have become much better at importing Word documents directly, without the need for a converter. The crashes are mostly gone, and footnotes come right on in. Nevertheless, I’m wondering what else might be done to a Word document to save time and trouble when importing into a layout program — and I’d greatly appreciate your thoughts about that. Here are some examples of the kind of thing I have in mind:

  • Add nonbreaking spaces to dates and initials.

For example, if the text includes a date like “August 17, 2016,” most typesetters want “August” and “17” to stay together; adding a nonbreaking space between the two elements does the trick. Similarly, if a name like “C. S. Lewis” shows up, it’s nice to keep the “C.” and the “S.” together. (To add a nonbreaking space in Word [Windows] 2007 and newer, hold down the CTRL and SHIFT keys as you press the spacebar. For Word [Mac], press the Option key as you press the spacebar.)

  • Remove formatting “overrides.”

Typesetters typically want to handle formatting with styles, so that changing a style attribute in InDesign automatically changes formatting throughout the document. If an author or editor has applied styles in a Word document, those styles can be imported and used in InDesign. But if an author or editor has applied direct formatting using various fonts, that formatting will be imported as “overrides” on the text, which can be a bit of a pain to clean up.

Override Options

Override Options

In its Styles pane, Microsoft Word offers to “Clear All” formatting and styles from selected text.

Clear All Option

Clear All Option

The problem is, “Clear All” really does mean “Clear All,” including not just font overrides but also such local formatting as bold and italic, which needs to remain intact. InDesign’s “Clear Overrides” feature has the same problem. Do you really want to remove italic formatting from the hundreds of journal titles in that giant manuscript you’re editing? If you’re proofreading or setting type, do you really want to put all that formatting back in again by hand? My FileCleaner add-in includes an often-overlooked feature (“standardize font formats”) that removes font overrides but leaves bold, italic, and other local formatting intact, which is exactly what’s needed.

Standardize Font Formats Option

Standardize Font Formats Option

  • Turn straight quotation marks into curly ones.

InDesign can do this—sort of. But it can’t handle things like “’Twas the night before Christmas” or “A miner, ’49er” (dreadful sorry, Clementine). FileCleaner does a much better job of dealing with this; it properly handles ’til, ’tis, ’tisn’t, ’twas, ’twasn’t, ’twould, ’twouldn’t, and ’em, as well as single quotation marks in front of numbers, all of which then come into InDesign correctly. If you have other items that should be included in this list, I’d love to know what they are.

  • Remove multiple spaces between sentences.

In the 1800s many books were set with extra space between sentences.

Sample of 1800s Typeset Page

Sample of 1800s Typeset Page

But, frankly, the 1800s were not exactly the golden age of typesetting.

1800s Poster

1800s Poster

Modern books include just one space between sentences. Still, many authors continue to use two, following the instructions they were given by their high-school typing teacher back in the twentieth century. And that means the double spaces need to be removed at some point. InDesign has built-in find-and-replace routines that will fix this and a few similar items.

InDesign Find & Replace

InDesign Find & Replace

FileCleaner, however, fixes many such things. And the version that’s included with Editor’s ToolKit Plus 2014 fixes many more.

FileCleaner Options

FileCleaner Options

  • Change italic and bold formatting to character styles.

Using character styles in InDesign provides much more stability and flexibility than local bold and italic formatting. It would be nice to have these styles already applied in Word before the document is imported into InDesign. My tools don’t currently do this, but they probably should.

QuarkConverter and InDesignConverter include some other useful fixes.

Quark Converter Options

Quark Converter Options

 

InDesign Converter Options

InDesign Converter Options

Nevertheless, I can’t help thinking that there must be things I’ve overlooked. I’m an editor, not a typesetter, so I don’t really know all of the things that typesetters have to fix that they really shouldn’t have to deal with. (This probably includes the most common errors that proofreaders mark.) So if you do typesetting or proofreading, would you help me out? I’d really like to know what I’m missing — things that could be cleaned up in an automated way in Microsoft Word before a document is ever imported into InDesign. What problems do you routinely encounter that you wish would go away? If you’ll let me know, I’ll try to come up with an add-in designed specifically to fix such things. Your suggestions for this would be most welcome.

Of course, typesetters and proofreaders aren’t the only ones who can benefit from this kind of cleanup. It’s also valuable to editors, allowing them to focus on words, structure, and meaning rather than deal with these tiny but pervasive problems. Little things like double spaces and straight quotation marks may not seem all that bothersome, but like pebbles in your shoe, they create subliminal annoyance that really adds up, making editing much more difficult than it should be. At least that’s my experience. What do you think?

Jack Lyon (editor@editorium.com) owns and operates the Editorium, which provides macros and information to help editors and publishers do mundane tasks quickly and efficiently. He is the author of Microsoft Word for Publishing Professionals, Wildcard Cookbook for Microsoft Word, and of Macro Cookbook for Microsoft Word. Both books will help you learn more about macros and how to use them.

Lyonizing Word: Inside Notes

by Jack Lyon

As useful as they are, Microsoft Word’s footnotes and endnotes are amazingly easy to mess up. Let’s look at some ways that can happen — and how to fix the problems.

First, we need to open a document that has footnotes — or make one. Then, to really see what’s going on, we’ll do this:

  1. Click “View” and then “Draft.”

Click "View" then "Draft"

Click "View" then "Draft"

2. Click “References” and then “Show Notes.”

Click “References” and then “Show Notes”

Click “References” and then “Show Notes”

That should take you into Word’s “Notes Pane,” which should look something like this:

Word’s “Notes Pane"

Word’s “Notes Pane"

Deleted Reference Numbers

The superscript numbers in front of each note are called reference numbers. By default, they’re formatted with a character style — either Footnote Reference or Endnote Reference, which you can modify if necessary. What’s interesting about these numbers is that it’s possible to delete them, so the notes look like this:

Deleting Note Numbers

Deleting Note Numbers

Deleting them, however, is an extraordinarily bad idea. Those numbers may look simple, but under the hood they have a lot going on. The number itself is automatically generated based on the reference number in the text itself. (If you create footnote number 9 in your document, the note itself will start with the number 9. If you delete footnote number 9 in your document, the note and its number will be deleted.) The number also signals the start of a new note, and if it’s gone, document corruption is probably not far behind.

You can often tell if a reference number is missing by looking at the other note numbers. If they’re numbered like this, you know something’s wrong:

A Clue That Something Is Wrong

A Clue That Something Is Wrong

That’s actually a fairly easy problem to fix: just copy the reference number from one of the other notes and paste it in front of the note that’s missing its number. For example, if you copy the number for note 3 and paste it in front of the numberless note 2, you’ll actually get a 2 in front of the note. Microsoft Word is smart enough to know what the number should be.

Usually, the reason a number is missing is because the author has directly deleted the entire text of the note, like this:

When Note Is Deleted Directly

When Note Text Is Deleted Directly

Why Microsoft hasn’t prevented this is beyond me. If the author had deleted the note number up in the main document text, there wouldn’t be a problem.

Typed-In Reference Numbers

Sometimes, in an effort to make notes look “pretty” or meet a certain style, authors will format reference numbers as regular text rather than superscript, then type a period after them. There’s really nothing wrong with that, other than introducing extraneous periods when importing the file into a typesetting program. But some authors actually delete the numbers and type in new ones by hand. You can tell when that has been done by putting your cursor in front of a double-digit note number and pressing the right cursor key. If your cursor moves past the entire number, the number has been automatically generated. But if your cursor moves forward only one digit, the number has been hand-typed.

Again, you could fix the problem by copying an automatic number and pasting it over the hand-typed number, but what if all of the numbers have been hand-typed? Where will you get an automatic number to copy? Simple: just insert a new footnote and copy the number from that. After you’ve finished pasting, delete the extra note (up in the text, remember).

If you have lots of these numbers, you probably won’t want to fix them by hand, so here’s an easier way:

  1. Select all of the notes in the notes pane.
  2. Copy the notes.
  3. Paste the notes at the end of the document.
  4. Using Word’s Find and Replace feature, search for ^f (the code for footnotes) or ^e (the code for endnotes) and replace all of the existing note numbers with a superscript 1. (That will also delete all of the automatic notes in the document.)
  5. Use the “Text to Notes” feature of my trusty NoteStripper add-in to turn the text notes into automatically numbering ones.

“Special” Carriage Returns

Sometimes when editing notes, you’ll try to make a deletion and get the message that “This is not a valid action for footnotes”:

Oops!

Oops!

What that cryptic message should say is “You can’t delete the carriage return that ends a footnote.” The carriage return that marks the end of a note isn’t a regular return; it’s a special return, and you can’t delete it — Word won’t let you. So what often happens is that authors will delete the note text and its reference number, leaving the carriage return behind. But there is a way to get rid of that return: delete its note number up in the main text of the document. If you can’t tell which note number that is, copy the number of a different note and paste it in front of the note’s carriage return. That will give the note a proper number, and you can then delete the note up in the main text. If you have lots of these extraneous carriage returns, you can get rid of them with a macro, as described in “Lyonizing Word: Deleting Extraneous Carriage Returns in Footnotes and Endnotes.”

Microsoft, Are You Listening?

We wouldn’t have such problems with notes if Microsoft would implement just a few changes:

  1. Make it possible to delete a note by selecting the entire note, including the note reference number, the note text, and the “special” carriage return at the end of the note, and then pressing the Delete or Backspace key (which should also remove the note number from the main text). That would keep authors from leaving behind misnumbered notes and extraneous carriage returns.
  2. Provide additional numbering options for the reference numbers in front of the note text, in particular the option to use full-sized numbers followed by a period. That would keep authors from typing in numbers and periods by hand (maybe).
  3. When trying to delete the reference number or carriage return, provide a message that says “Select the entire note before deleting” or “To remove a note, delete the note number in the main text of your document.”

These changes would do a lot to prevent problems caused by authors who don’t know how to properly use Word’s notes. You can help by letting Microsoft know about these needed changes. Give your feedback at Microsoft’s “Welcome to Word’s Suggestion Box!

What about you? Have you seen other odd problems with Word’s notes? If so, how have you solved them?

Jack Lyon (editor@editorium.com) owns and operates the Editorium, which provides macros and information to help editors and publishers do mundane tasks quickly and efficiently. He is the author of Microsoft Word for Publishing Professionals, Wildcard Cookbook for Microsoft Word, and of Macro Cookbook for Microsoft Word. Both books will help you learn more about macros and how to use them.

Can I Publish This Photograph of the Mona Lisa?

by Jack Lyon

In a departure from my usual technical stuff, I recently finished writing a rather specialized book on Christian symbolism, featuring numerous works of art from the Middle Ages and Renaissance. Rather than publish the book myself, I decided to go with a publishing company that has considerably more marketing mojo than I do. And of course, that’s when the fun started:

Dear Mr. Lyon:

We would respectfully request that you please send documentation of your permission(s) to use third-party images and a list of the images for which you feel permission is not required. The list should include (1) the title of the image, (2) where the image can be found online, and (3) why you feel permission is not required.

Here is my reply:

I respectfully decline your request as unnecessary. The images I’m using are in the public domain.

The publisher’s representative replied:

If I take a photograph of a Leonardo da Vinci painting, I own the copyright in that photo. And even though it’s a photo of a public-domain item, you still need my permission to use my photograph.

So what do you think? Is the publisher’s representative correct? After all, that’s the common understanding. But actually, it depends on the nature of the photograph. In both the United States and in Europe, a photo that is merely a reproduction of a public-domain work (such as an old painting or stained-glass window) is not protected by copyright. To quote the U.K.’s Intellectual Property Office (based on the opinion of the European Court of Justice), “Copyright can only subsist in subject matter that is original in the sense that it is the author’s own ‘intellectual creation’” (for more information, see Wikipedia).

In the United States, this issue was decided in the case of Bridgeman Art Library v. Corel Corp., in which the court ruled that exact photographic copies of public-domain images could not be protected by copyright in the United States because the copies lack originality (which, by the way, is the deciding factor).

So even if someone claims copyright in a photograph that reproduces a public-domain image, no permission is needed, because photos that are simply copies of public-domain works and lack any aspect of originality are themselves in the public domain. In fact, the more faithful the reproduction, the less originality there is. Wikipedia has an excellent example — a photograph of the Mona Lisa.

The whole point of that Wikipedia photo is to reproduce the Mona Lisa as accurately as possible — which is precisely to eliminate any elements of originality. In fact, dozens of such photos might exist, all indistinguishable from one another. And that’s why such reproductions are not protected by copyright.

Now, if you took a photo of the Mona Lisa that was not simply a copy of the painting but rather had its own original elements (such as special lighting or camera angle), that photo would not be in the public domain; you would indeed own the copyright in that photo. Here’s an example of an image that is not in the public domain: non–public domain Mona Lisa.

Here’s another version that would be under copyright because it includes original content: original content Mona Lisa.

Rich Adin raised an interesting question about this: Would a black-and-white photo of the Mona Lisa have enough originality to be protected by copyright? It would probably depend on how much originality the photo might be judged to have, and perhaps that would have to be settled in a court of law. Please note that just because you use someone’s highly accurate photograph of a public-domain image doesn’t mean the person can’t sue you for doing so, even though that person might lose the case.

Interestingly, contemporary photos of statues are always under copyright, as there’s no way to accurately reproduce a three-dimensional object in a two-dimensional photo. A photo of Rodin’s Thinker will always have elements of originality based on framing of the statue, lighting, focus, and so on.

But what about a photograph of artwork that’s not in the public domain? For example, consider the fine art of Carolyn Hutchings Edlund (who happens to be Rich Adin’s better half). If she takes a photograph of one of her paintings and posts it on her website, can I legally use that photograph as the cover image for my next book? No, I can’t. Why? Because Carolyn’s painting is not in the public domain. It’s her original creation, and she owns the copyright. Even though her photo of the painting may lack originality, her painting does not, and I’m not at liberty to use that image without her permission.

One question I haven’t addressed yet is how to know whether or not something is actually in the public domain — something that can be tricky to ascertain. In the United States, anything created before 1923 is generally fair game. In other countries, however, copyright terms may be more stringent, so care and caution are needed.

As the standard disclaimer goes, I am not a lawyer, and you should not consider this article as legal counsel in any way. Nevertheless, I hope that my experience with all of this might be useful to you in your own battles in the wonderful world of publishing.

Jack Lyon (editor@editorium.com) owns and operates the Editorium, which provides macros and information to help editors and publishers do mundane tasks quickly and efficiently. He is the author of Microsoft Word for Publishing Professionals, Wildcard Cookbook for Microsoft Word, and of Macro Cookbook for Microsoft Word. Both books will help you learn more about macros and how to use them.

Lyonizing Word: Using the “Find What Expression” Wildcard

by Jack Lyon

Rich Adin recently sent me an interesting challenge. He was using his EditTools Journal feature to mark journal titles in references. The power behind that useful tool comes from lists of incorrectly styled references with corresponding correctly styled references. He creates a separate list for each reference style. The list he sent me was for AMA style, in which the reference uses the PubMed abbreviation followed by a period. It looks like something like this:

A Gesamte Exp Med, | cyan -> Z Gesamte Exp Med.
A Gesamte Exp Med. | cyan -> Z Gesamte Exp Med.
A JR | cyan -> AJR Am J Roentgenol.
A M A Arch Ind Hyg Occup Med. | green
A of LTC | cyan -> Ann Longterm Care.
A of LTC, | cyan -> Ann Longterm Care.
A of LTC. | cyan -> Ann Longterm Care.
A&D | cyan -> Aging Dis.
A&D, | cyan -> Aging Dis.
A&D. | cyan -> Aging Dis.
A. M. A. Arch. Derm | cyan -> AMA Arch Derm.
A. M. A. Arch. Derm, | cyan -> AMA Arch Derm.
A. M. A. Arch. Derm. | cyan -> AMA Arch Derm.

The text to the left of the pipe (|) is how the entry might (incorrectly) appear in the references supplied by the author; the entry to the right is how it should appear. Each entry includes a color, either cyan or green, which tells the program to use that color in highlighting the reference.

Rich knew that some of the entries included duplicates, like this:

Arch Intern Med. | cyan -> Arch Intern Med.

In other words, the item on the left was identical to the item on the right, which meant that it shouldn’t be marked. That also meant the entry didn’t need to be on the list at all. But the real problem was that Rich’s reference list included more than 117,000 entries!

Rich’s challenge? Use wildcard find and replace to remove such entries, thus shortening the list and preventing unnecessary marking.

First, let’s look at that entry again to see what we might need to do:

Arch Intern Med. | cyan -> Arch Intern Med.

There’s a pipe symbol (|) in the middle, which gives us something to differentiate the left side of the entry from the right side of the entry. So we might set up the first part of our wildcard string to look like this:

([!^013]@) |

That tells Word to find any character except a carriage return, an unspecified number of times, until it comes to a space followed by a pipe symbol.

The wildcard for a carriage return is:

^013

The wildcard for “except” is:

!

And we have to put both of those in square brackets so Word knows that’s a set of characters. (After all, [!^013] finds any character, no matter what it is, unless it’s a carriage return.)

The wildcard for “an unspecified number of times” is:

@

Finally, we have to put all of that into a “group” by enclosing it with parentheses. And that’s important. You’ll see why in a minute.

Testing that part of our search string, we see that, yes, indeed, it finds the following:

Arch Intern Med. |

In fact, it finds the beginning of each entry, which is just what we want.

Now let’s look at the right side of our entry:

 cyan -> Arch Intern Med.

You can’t see it here, but there’s a space in front of “cyan” — the space that follows the pipe symbol. So we need to include that space in our search string, along with the word “cyan” (in the following examples, I use [space] to represent a space so you can see it; [space] should not actually be entered; use a real space created by pressing the space bar):

[space]cyan

There’s also a space after cyan, so we’ll need to include that as well.

[space]cyan[space]

That needs to be followed by a hyphen, a right angle bracket, and yet another space, like this:

[space]cyan[space]->[space]

But now you may be wondering why I put a backslash in front of the angle bracket. It’s because the angle bracket is itself a wildcard (a subject for another day), so we need to tell Word we’re using it as an actual character, which is what the backslash does.

Finally, the rest of our search string looks like this:

1^013

This part of the string —

1

— is the “Find What Expression” wildcard, which is what this article is about, and it certainly took us a long time to get to it!

Remember back when we grouped the very first part of our search string in parentheses?

([!^013]@)

That “group” is the “expression” that the 1 wildcard represents. In algebraic terms:

1 = ([!^013]@)

And that means 1 will find whatever is found by the ([!^013]@) expression, which, my friend, is extremely cool, because it will allow us to weed out the duplicate entries on our reference list—entries like this:

Arch Intern Med. | cyan -> Arch Intern Med.

Now, for the first time, let’s look at our entire search string:

([!^013]@) | cyan -> 1^013

By now, you probably understand this quite well. The string finds any characters except a carriage return until it comes to a space and a pipe symbol; then it finds a space, the word “cyan,” and another space, followed by a hyphen, a right angle bracket, and a space. Finally (and most importantly), it finds whatever was found by the parenthetical group, followed by a carriage return.

Now we simply need to make sure that Word’s “Replace with” box is empty and click “Replace All.” All of those unnecessary entries will be deleted. (We’ll need to repeat with “green” for the entries that don’t include “cyan.”)

Which would you rather do: Find and delete such entries manually (with just 117,000 to look through) or have Word do it automatically?

That’s the power of the “Find What Expression” wildcard. In future articles, I’ll show you more uses for this wonderful tool, along with other Word wildcards.

Jack Lyon (editor@editorium.com) owns and operates the Editorium, which provides macros and information to help editors and publishers do mundane tasks quickly and efficiently. He is the author of Microsoft Word for Publishing Professionals, Wildcard Cookbook for Microsoft Word, and of Macro Cookbook for Microsoft Word. Both books will help you learn more about macros and how to use them.

Lyonizing Word: But Which Styles?

by Jack Lyon

In my previous article, Lyonizing Word: Taming Styles in Microsoft Word, I explained how to make Microsoft Word display only the paragraph styles you want to use. But that raises an important question: Which paragraph styles do you want to use?

If you’re writing a simple business letter, the only style you may need is Word’s default of Normal. But if you’re editing a book, things immediately become much more complicated. Consider: What different kinds of text exist in a book? Let’s start with the title page; at a minimum, it includes the following elements:

  • Title
  • Author
  • Publisher

It may also include these:

  • Subtitle
  • Publication date

And that means you’ll probably need a paragraph style for each one of those. Why? Because the designer may want to format each element differently. Even if that ends up not being the case, you’ve at least allowed for the possibility. In addition, using a different style for each element makes it possible to use those elements as metadata, and that can be important in electronic publishing. Back in the late 1990s, I was involved in the production of an enormous electronic library. Most of the books were already styled with—that’s right—Title, Author, and Publisher, making it fairly easy to access those elements through a database and thus allow the user to sort books by title, author, and so on.

What styles will you need as you get into the book’s chapters? You might want to pull a couple of books off your shelves and see. You’ll probably find that you’ll need (at a minimum):

  • Chapter number
  • Chapter title
  • Body text

And as you get deeper into the book, you may need some of the following:

  • Block quotation
  • Poetry
  • Subheading
  • Subsubheading

Most books include a multitude of other elements, such as:

  • Dedication
  • Epigraph
  • Caption
  • Notes
  • Bibliography

And on and on and on.

Do you really need all of this detail? Yes, you do. Even if epigraphs and captions are going to look the same (e.g., both will use left-justified 10-point New Century Schoolbook), you as an editor, working in an editorial capacity, shouldn’t be thinking about how epigraphs and captions will look; you should be thinking about whether a specific bit of text is an epigraph or a caption and applying the metadata (a style) that marks it as such. Otherwise, the designer and typesetter won’t know for sure which text they need to format in a certain way. In addition, applying the proper metadata (styles) to epigraphs and captions makes them accessible and manipulable in various ways for later electronic publishing.

Can’t you just let the designer or typesetter take care of all this styling? No, you can’t. Deciding what text should be marked with which style is an editorial matter, not a design or typesetting one. Is this bit of text a subheading or a subsubheading? Should that bit of text be run in or pulled out as a block quotation? Is this line really an epigraph or just part of the body text? Is that line a chapter title, or should it be relegated to a subheading? All of these are editorial decisions; they have to do with what the text is and with what the text means.

Design decisions, on the other hand, have to do with how the text looks. The editor has styled this line as an epigraph. Should it be set in Comic Sans? (Horrors!) Should it be set in italics? Should it be a smaller point size than body text? Should it be centered?

So what styles do you really need? It depends on the book. And there’s no way to know without actually going through the book to find out. I tend to do this as I work, creating new styles as the need arises. Hey, that’s a poem! Guess I’ll need a poetry style (which I then create and apply).

And what should my poetry style look like? For editorial purposes, it doesn’t matter, as long as I can tell that the poetry style has been applied. For example, I might set up the style to be indented half an inch on both sides, with the text color set to blue. When the designer and typesetter bring the text into InDesign, they can redefine the style any way they like. But for now, I can tell that I’ve styled that text as poetry, which, for me as an editor, is all that matters.

In this article, I’ve assumed that you’re creating the styles you need to use, as that’s how I usually work. But for the most part, editors who work for publishers don’t need to do that. Publishers often have their own sets of styles that they require editors to use, and these styles are usually stored in a Word template. For example, you can download the Springer template and the Wiley template. Both templates are well worth looking at, just so you can get an idea of what publishers are looking for in the way of styled manuscripts. Wiley provides additional information in an online article "Applying Formatting Styles."

You may also be interested in my Author Tools Template, which is a collection of styles that make it easy for authors (and editors) to produce properly styled manuscripts, which means that publishers can then use those manuscripts without having to restyle the text.

In addition, if you’re working with styles as I’ve explained in this article, you owe it to yourself to check out the Style Inserter in Rich Adin’s EditTools. This is a slick feature that overcomes the problems with styles that I discussed in my previous article (see Lyonizing Word: Taming Styles in Microsoft Word) and makes it easy to apply publisher styles to a manuscript.

bodytextIt’s worth noting that some publishers don’t use styles at all. Instead, they require editors to mark up text with publisher-supplied codes like the one at the beginning of this paragraph. In that case, it’s important not to type the codes in by hand, as doing so can easily lead to errors. Instead, editors should use something like Code Inserter, which is included in EditTools.

In the 1980s, I worked on the Penta system, which used such codes extensively. During the 1990s, however, I switched to WordPerfect 6.0 and finally to Microsoft Word, and marking text with styles became a more intuitive way to work.

So what styles do I routinely use today? Here’s the minimal list, which I use in all of the books I publish at Waking Lion Press:

  • Half-Title
  • Title
  • Subtitle
  • Author
  • Publisher
  • Copyright
  • Dedication
  • Epigraph
  • Epigraph Source
  • Part
  • Chapter
  • Section
  • Subsection
  • Block quote
  • Poem
  • Poem Heading
  • Poem Source
  • Bibliography
  • Notes

How about you? What styles do you routinely use? And do you have any tips on how to use them? If so, I’d love to hear from you.

Jack Lyon (editor@editorium.com) owns and operates the Editorium, which provides macros and information to help editors and publishers do mundane tasks quickly and efficiently. He is the author of Microsoft Word for Publishing Professionals, Wildcard Cookbook for Microsoft Word, and of Macro Cookbook for Microsoft Word. Both books will help you learn more about macros and how to use them.

Lyonizing Word: Taming Styles in Microsoft Word

by Jack Lyon

Microsoft Word includes a powerful feature for marking the various levels of a manuscript (such as headings, block quotations, poetry, and so on). That feature is styles, which are valuable for many reasons, including:

  • They make it possible to reformat a whole document simply by redefining styles or applying a different template using those styles.
  • They make it possible to find and replace only text using a certain style. For example, you might want to find source citations by searching for parentheses in text styled as block quotations.
  • They make it possible to generate a table of contents based on specified styles.

So styles are very useful. The problem is that Microsoft Word, in its usual “helpful” way, tries to manage which styles are available, in which document, and how those styles can be accessed. Finally growing tired of this nonsense, I decided to take the matter firmly in hand by writing this article.

My first gripe is that Word decides which styles to show in the Styles area of the Home ribbon, which decision seems to be based on nothing that makes any sense. Right now, it’s showing the following:

Quick Style Gallery

Quick Style Gallery

Of the styles available, I use Normal and Heading 1. But Strong? Subtle Emphasis? Intense Emphasis? Who makes this stuff up? Not an actual writer or editor, that’s for sure. So the first thing to do is get rid of the icons for the styles I never use:

  1. Right-click the icon (such as that for Strong).
  2. Click “Remove from Quick Style Gallery” (which, evidently is what the Styles area is called).

Remove from Quick Style Gallery

Remove from Quick Style Gallery

Now, the question is, when I restart Word or create a new document, does the Strong icon come back? Let’s find out. (Now restarting Word.)

Ha! It’s gone! But what happens if I create a new document? (Now creating a new document.)

Shoot, Strong is back again. So we can conclude that removing a style from the Quick Style Gallery applies only to the document in which we remove the style.

I could get rid of Strong and then save what I’ve done as a Quick Style Set:

Save as Quick Style Set

Save as Quick Style Set

But I’d like to get rid of Strong once and for all. How can I do that?

Well, I’ll start by showing Word’s task pane (by clicking the little arrow at the bottom right of the Styles area):

Word's task pane

Word's task pane

Now I should be able to click the drop-down arrow next to Strong and delete it, right? Nope. Word won’t let me. How annoying!

Delete Strong

Delete Strong

Well, then, where does the Strong style live? In Word’s Normal.dotm template, of course. Can I get rid of it there? I open the folder where the template lives, which on my computer is here:

C:UsersJackAppDataRoamingMicrosoftTemplates

Then I open the Normal.dotm template. Now can I delete the Strong style?

No, I can’t; same problem as before. Word really, really, really wants to keep its built-in styles — which is why they’re called “built-in,” I guess. So my only recourse is to (1) set how the style will be displayed and then (2) tell Word which styles to display. Here’s how:

  1. Open the Normal.dotm template, which is where your default styles are stored.
  2. Under Style Pane Options (the blue “Options” link at the bottom of the task pane), set “Styles to Show” as “Recommended.” Select “New documents based on this template.”

Show styles as recommended

Show styles as recommended

  1. Under Manage Styles (the third button at the bottom of the task pane), set all styles to “Hide” or “Hide until used” except those you want to show. (Even now, Word won’t let you hide everything.) Select “New documents based on this template.”

Hide Strong

Hide Strong

  1. Make any other adjustments you’d like, such as the order in which the styles will appear in the task pane.
  2. Save and close the Normal.dotm template.

After you’ve done that, every time you start Word or create a new document, you’ll get only the styles you want to see. I think. I hope. Maybe.

How about you? Do you have any helpful hints about how to tame Word’s styles? If so, I’d love to hear from you.

Jack Lyon (editor@editorium.com) owns and operates the Editorium, which provides macros and information to help editors and publishers do mundane tasks quickly and efficiently. He is the author of Microsoft Word for Publishing Professionals, Wildcard Cookbook for Microsoft Word, and of Macro Cookbook for Microsoft Word. Both books will help you learn more about macros and how to use them.