Resetting Shakespeare

By Jack Lyon, the Editorium

In late 2020, I discovered the down-to-the-character transcription of Shakespeare's First Folio created by PlayShakespeare.com. I knew immediately what I had to do: Reset the type in a new edition of the book—something that's never been done since the 1600s. With the encouragement and advice of Ron Severdia, founder and editor-in-chief of PlayShakespeare.com, I downloaded the files and began preparing them for typesetting.

I soon realized that this project would be much harder than I originally thought. The Microsoft Word documents available from PlayShakespeare.com had no formatting at all—just plain old text. Immaculately coded XML files were also available, but the proprietary files needed to transform them into formatted text were not. Ultimately, I downloaded the nicely formatted HTML versions of the individual plays, but making them match the layout of the original still required much wildcard manipulation and styling in Word.

I briefly considered typesetting the pages in Word itself, but I've learned from long experience that Word will fight you every step of the way in long, complex projects like this one. Even Adobe InDesign wasn't as cooperative as I'd hoped. Finally, I went with Affinity Publisher, which still has a few bugs but did exactly what I needed. You can download a sample of the finished pages here.

What made this interesting to me as an editor was seeing the editorial and typographical style used by the compositors of the book in Shakespeare's era. Here are just a few examples:

  • The first letter (and only the first letter) after a drop cap is capitalized. Modern practice is to set the rest of the word (and probably a couple of the following words) in small caps.
  • The typographical weight of important words doesn't seem to matter much. For example, there's "THE TRAGEDIE OF Othello, the Moor of Venice." Modern practice might be to set the title as "The Tragedie of OTHELLO, the Moor of Venice."
  • Page numbers start again at 1 for each section of the book: "Comedies" starts on page 1. "Histories" starts again on page 1. Finally, "Tragedies" starts on a new page 1. Modern practice is to use continuous page numbers throughout the book.
  • Running heads vary in typography and layout, and they may or may not match the title of the play. This may be because certain type styles in certain sizes were limited (metal type, remember), so if typesetters ran out of one style, they'd simply use another, even within the same play.
  • Modern practice is to be consistent in formatting; if one subheading is set in 10-point Helvetica, all subheads should be set in 10-point Helvetica. Not so in the First Folio. If you look through the original front matter, you'll see some regular text in 14 points, other regular text in 10.5 points. Poetry is all over the place, some almost too small to read. Once we get into the plays themselves, formatting is more regular, so perhaps the front matter was thought of as display type.
  • The letters I and J, along with U and V, are usually interchangable: "Ben Ionson." "If Musicke be the food of Loue, play on."
  • Abbreviations abound, not just to indicate the name of a player but also to fit words into a line that's too long for the measure. For example, "them" might be set as "thē"; "thou" is sometimes abbreviated as "ÿ."
  • Other lines that are too long are set with the final words above or below the rest of the line, following an opening parenthesis.
  • Ligatures are plentiful, evidently just to look fancy.
  • Dashes are long and extra dark.
  • Spelling and capitalization vary, probably depending on who is setting type at the time. For example, sometimes we have "Scena Secunda," at other times "Scœna secunda." And what's with all that Latin?
  • Sometimes nouns are capitalized for no apparent reason: "That our Garments being (as they were) drencht in the Sea." I wonder if this might be a cue to the actor about how to speak the line.
  • Colons, semicolons, question marks, and exclamation marks are usually preceded by a space. Other punctuation is not.
  • Apostrophes are used to mark words that should be spoken as one syllable rather than two: "Fetch me that flow’r."

I'm sure I've missed some things, but I seem to have misplaced my copy of The Jacobean Manual of Style. I wonder if William and Isaac Jaggard (who printed the First Folio) had such a thing. If they did, they didn't hesitate to depart from it when circumstances demanded it. In that, we moderns may not be so different.

If you're interested in buying a copy of Shakespeare's First Folio Ultimate, you can do so at Amazon, Barnes & Noble, and other purveyors of fine literature.

Inserting Unlinked Sections

Microsoft Word, in a broad sense, has two kinds of section breaks (Insert > Break):

1. The kind that starts a new page (which includes "Next page," "Even page," and "Odd page).

2. The kind that doesn't (which includes "Continuous").

The second kind is used for such things as inserting two-column text in the middle of single-column text, all on the same page, and it doesn't concern us here. What does concern us, as editors, is what happens when we insert a section break that starts a new page.

By default, Word "links" the headers and footers in the new section to those in the previous section. In other words, when you insert a section break, the headers and footers will be the same in the new section as in the old. If you're working as an office assistant and spend most of your time formatting letters and reports, that might be exactly what you want. But if you're a book editor, that's probably *not* what you want. You probably want each section (that is, each *chapter*) to have its own headers and footers, as explained here:

http://lists.topica.com/lists/editorium/read/message.html?mid=1716147063

In a previous newsletter, I supplied a macro that would unlink *all* headers and footers en masse:

http://lists.topica.com/lists/editorium/read/message.html?mid=1712666067

But wouldn't it be nice to be able to simply insert an *unlinked* section break? Here's a macro that will do just that, using a break of the "Next page" variety:


Sub InsertUnlinkedNextpageSection()
Selection.InsertBreak Type:=wdSectionBreakNextPage
If ActiveWindow.View.SplitSpecial <> wdPaneNone Then
ActiveWindow.Panes(2).Close
End If
If ActiveWindow.ActivePane.View.Type = wdNormalView Or ActiveWindow. _
ActivePane.View.Type = wdOutlineView Then
ActiveWindow.ActivePane.View.Type = wdPrintView
End If
ActiveWindow.ActivePane.View.SeekView = wdSeekCurrentPageHeader
Selection.HeaderFooter.LinkToPrevious = Not Selection.HeaderFooter. _
LinkToPrevious
ActiveWindow.ActivePane.View.SeekView = wdSeekMainDocument
End Sub

If you don't know how to use such macros, you can learn how here:

http://lists.topica.com/lists/editorium/read/message.html?mid=1706922855

Note that you can modify the macro to insert an odd-page section break or an even-page section break, which are particularly useful in book publishing. To do so, change this line--

Selection.InsertBreak Type:=wdSectionBreakNextPage

to this--

Selection.InsertBreak Type:=wdSectionBreakOddPage

or this--

Selection.InsertBreak Type:=wdSectionBreakEvenPage

For ease of use, I'd recommend that you hook the macro to a menu, toolbar button, or keyboard shortcut, as explained here:

http://lists.topica.com/lists/editorium/read/message.html?mid=1707444986

http://lists.topica.com/lists/editorium/read/message.html?mid=1707286867

http://lists.topica.com/lists/editorium/read/message.html?mid=1713088939

Then, the next time you insert a section break (using the macro, of course), it will be automatically unlinked!

_________________________________________

RESOURCES

You'll find a very nice tutorial on sections, section breaks, and headers and footers at Charles Kenyon's AddBalance website:

http://www.addbalance.com/usersguide/sections.htm

Especially if you work in a legal environment, you'll find many other useful resources at the site. Don't overlook the complete, downloadable User Guide (scroll down to the bottom of the page):

http://www.addbalance.com/usersguide/

Section Breaks

In this final installment of how to set up book pages for publishing, we look at section breaks in Microsoft Word. Section breaks let you do a number of things. The most important ones for our purposes are:

* Restart page numbers from section to section--between front matter and chapters, for example.

* Restart footnote and endnote numbers from chapter to chapter.

* Use different running heads from chapter to chapter.

Let's say you've got your whole book in one giant document. (Yes, that's how I like to work.) You'll now want to separate your chapters with section breaks. To do so:

1. If you're not already in Normal view, switch to it by clicking View > Normal. That will allow you to see the breaks you're going to insert.

2. Go to the first place you want to change page numbers, note numbers, and running heads. The first page of your preface will do nicely.

3. At the top of the page, insert a section break by clicking Insert > Break > Page break. Under "Section break types," select "Odd page" (if you want to go the traditional book-publishing route). Then click the OK button. Your document should now include a double-lined section break labeled "Section Break (Odd Page)." If you already had a manual page break there, get rid of it.

4. Repeat steps 2 and 3 at the beginning of each chapter in your book.

Microsoft, catering as usual to office workers rather than publishing professionals, has set up Word by default to have headers and footers from section to section be the "same as previous." That is, if you set up running heads in one section, they'll automatically continue into the next. But in book publishing you don't want them to be the same; you want them to be different.

One way to "unlink" them is to go to your second section (your preface, for example), click View > Header and Footer, and then click the "Same as previous" button on the Header and Footer toolbar. (To see which button is which, rest your mouse cursor over each button for a few seconds until the ToolTip appears.) Then repeat the procedure for each chapter (section) of your book. Failure to unlink headers and footers will eventually drive you mad.

An easier way to unlink them is to use the following macro:


Sub UnlinkHeadersFooters()
Dim curSection As Section, curHeader As HeaderFooter
For Each curSection In ActiveDocument.Sections
For Each curHeader In curSection.Headers
curHeader.LinkToPrevious = False
Next curHeader
For Each curHeader In curSection.Footers
curHeader.LinkToPrevious = False
Next curHeader
Next curSection
End Sub

If you don't know how to use macros like that one, you can find out here:

Once the headers and footers are unlinked, you can set up headers, footers, and page numbers for *each section* as explained in last week's newsletter:

http://lists.topica.com/lists/editorium/read/message.html?mid=1716147063

While you've got the Header and Footer toolbar available, click the Format Page Number button and tell Word how you want page numbering to work from the previous section to the current one. You can "continue from previous section" or "start at" a number you specify. If you're going from front matter (with Roman numerals) to a chapter (with Arabic numerals), "Start at" is the option you want to use.

And now for those note numbers:

1. Click Insert > Reference (in Word 2002+) > Footnote.

2. Select Footnote or Endnote as appropriate.

3. Click the Options button.

If you're using Footnotes, you can set note numbers to:

* Continuous (throughout the book--not recommended).

* Restart each section (chapter, that is; the traditional method).

* Restart each page (unorthodox but elegant).

With endnotes, only the first two options are available, since endnotes don't appear on each page.

Click the OK button to finish up.

_________________________________________

READERS WRITE

Hilary Powers wrote:

Jack, is there any chance of making MegaReplacer see Language settings? My latest oops is in U.K. English throughout, and it'd have been pleasant to use MegaReplace to fix the files in one fell swoop. (It turns out that AutoCorrect, where a lot of my shortcuts lurk, is language specific, so I can't wait to find out--but it'd be a real enhancement.)

I replied:

How about using MultiMacro to do this? You could record a macro that:

1. Selects all.

2. Sets language.

Then have MultiMacro run the macro on your files.

Hilary responded:

I recorded the guts of the macro you described, which turned out to be

Selection.WholeStory

Selection.LanguageID = wdEnglishUS

Application.CheckLanguage = True

and stuffed it into the macros I use (via MultiMacro, of course) to set the working template for each job at the beginning. Hey presto! No more need to think about language settings.

You can learn about MultiMacro here:

http://www.editorium.com/14844.htm

____________________

John Eagleson wrote:

I'm trying to do something that is a kind of variation on MegaReplacer, but I haven't yet found a tool in your arsenal that does it.

A simple example:

I want to search for "January," and when I find it I want to be able to stop and edit the term. Depending on the context I may want to precede it with a nonbreaking space (20 January), follow it with a nonbreaking space (January 21), abbreviate it, or leave it as is.

When I'm finished I want to then hit or some other key and find the next instance of January.

When I'm finished January I want the macro to do the same with February, and so on.

Do you have a way to do that?

I replied:

My Go2Text macro will kind of do what you need. You can use it to find a word, such as "January," and all succeeding instances of "January," but once the word was found, you'd have to make the changes manually.

http://www.editorium.com/freebies.htm (scroll to the bottom of the page)

You wrote that the replacement would depend on the context, so one way to approach the problem is to figure out what the context is in each case. For example, one context would be "January" preceded by a space and one or more numbers. Another context would be "January" *followed* by a space and one or more numbers. In wildcard terms:

Find what:

([0-9]{1,2}) (January)

Replace with:

12

Find what:

(January) ([0-9]{1,2})

Replace with:

12

And so on.

Then, once you've identified the various contexts, it's a fairly simple matter to set up the wildcard Find and Replace strings (with February, March, etc.) to feed to MegaReplacer.

If you need more information on Find and Replace with wildcards, see the paper on advanced searching that came with MegaReplacer.

You might also be able to use some of the information in this newsletter article:

John responded:

I think I found one way to do what I'm trying to do with MegaReplacer.

In my example of finding all the names of the months and pausing at each one to allow editing, I want to be able to do this without typing the names of the months each time.

Solution:

1. Set up a file with the names of the months:

January|

February| etc.

Only the pipe is needed here since I'm not going to be replacing anything yet.

2. Run MegaReplacer with Mark Automatically checked. Now all the months are marked.

3. Search for the CheckMe character style. I use CTRL+PGDN to move from one month to the next (aka BrowseNext).

Thanks for your help--and your marvelous macros!

You can learn about MegaReplacer here:

http://www.editorium.com/14843.htm

Many thanks to Hilary and John for their questions and solutions.

_________________________________________

RESOURCES

Steve Hudson's Indexing Add-in

Steve Hudson, the Word Heretic, has done it again, with his Indexing add-in for Microsoft Word. If you're creating indexes in Word, you know about the difficulties involved, not the least of which is having no way to jump from an index entry to the text it refers to. Steve has solved that problem; his indexing add-in creates *clickable hyperlinks* from index page numbers to the pages they refer to. Click on a hyperlink and jump to the text, where you can tweak and twiddle to your heart's content. In addition, the add-in highlights the entries so you can actually *see* them for a change. You can learn more--and download the complete documentation--here:

http://www.geocities.com/word_heretic/Indexing.html

If you're indexing in Word, you owe it to yourself to try this excellent new product. And while you're at Steve's website, be sure to check out his other products (notably his books) and his programming and writing services:

http://www.wordheretic.com

Setting Up Headers and Footers

After you've set up the pages of your book (as explained in the last newsletter), you'll need to set up headers and footers. Using Microsoft Word, you might think you'd find headers and footers under the Insert menu. Not so; they're under View. Why? Because your document *already* includes headers and footers. Every Word document does. But they're empty until you put something in them. Here's how:

1. Click View > Header and Footer. You'll now find your cursor in the Header pane, with a nice little toolbar that lets you do various things:

Insert Page Number

Insert Number of Pages (so you can create a footer like "Page 7 of 123")

Format Page Number (1, 2, 3; a, b, c; i, ii, iii; and so on. Include chapter number [1-1; 1-A]. Continue from previous section [neat!] or specify a starting number.)

Insert Date (useful for creating slug lines)

Insert Time (ditto)

[Activate] Page Setup (handy!)

Show/Hide Document Text (to keep things uncluttered while creating headers and footers)

[Set header and footer to] Same as Previous [section] (in case you're using columns, for example, in one of your chapters; I almost always turn this off)

Switch between Header and Footer

Show Previous [header or footer]

Show Next [header or footer]

Close Header and Footer

2. Skip the header of your first page (labeled "First Page Header"), which will be the opening page of your chapter and thus doesn't need a running head. To do so, click the button to Switch between Header and Footer.

3. You're now in the footer (labeled "First Page Footer") of your chapter's opening page. Do you want a page number? I do. To get one, click the Insert Page Number button. (If this were front matter, you could click the Format Page Number button and set your numbering to use Roman numerals.) I *don't* recommend creating a page number with Insert > Page Number, because it puts the page number into a frame.

4. Decide whether you want the page number on the left, center, or right of your page and make it so. The easiest way to do this--and the most heretical, since it doesn't use styles--is to click Format > Paragraph > Alignment and pick your pleasure.

5. Move to the next page by clicking the Show Next button. This will take you to the next page's footer (labeled "Even Page Footer"). Since we previously set up our document to have different first, left, and right pages, you'll need to insert another page number here; it won't just continue the numbering from the first page. Again, format the number as left, center, or right. Since this is an even (and therefore left, or verso) page, you may want to put the page number on the left.

6. Repeat step 5 for the footer on the next page, which will be a right-hand (recto) page. You may want to put the page number on the right.

7. Move to the previous page's header (verso; labeled "Even Page Header") by clicking the Show Previous button and then the button to Switch between Header and Footer. Type the text of your header into the Header pane. In book publishing, items that are more inclusive go on the left; items that are less inclusive go on the right. A few options:

LEFT RIGHT

Author Name Book Title

Author Name Part Title

Author Name Chapter Title

Book Title Part Title

Book Title Chapter Title

Part Title Chapter Title

8. Again, the easiest way to put the running head on the left, center, or right of the page is to click Format > Paragraph > Alignment. Since this is an even page, you may want to put the running head on the left.

9. Move to the next page's header (recto) by clicking the Show Next button. Type the text of your header into the Header pane. Since this is an odd page, you may want to put the running head on the right.

10. Set the font and point size for your running heads and page numbers by modifying their styles under Format > Style. You want them to match the rest of your text, right? While you're in there, make sure they're not set up with an automatic first-line paragraph indent, which will make them look funny on the page.

11. Adjust the space between headers, text blocks, and footers by clicking the Page Setup button and the Margins tab. Then set the distance "From edge" (of the paper) of the header and footer. This may take some experimentation to get right, but when you're finished, your pages should look pretty good.

12. Click the Close button to get back to your document text.

To see your handiwork, click View > Print Layout and set View > Zoom to Whole Page. Wow! (Note that your folios [page numbers] and running heads are automatically repeated on successive pages.)

You'll need to repeat this whole procedure for each succeeding chapter, and if all of your chapters are in one document, you'll need to separate them with section breaks. More on that next week.

_________________________________________

READERS WRITE

Thomas C Dixon wrote:

I edited a book recently that showed two book pages per screen, with the pages numbered consecutively. I've read your article on page sizing, etc., but can't get this effect. How is it achieved?

I responded:

You can achieve what you're describing like this:

1. Click File > Page Setup.

2. Click the Margins tab.

3. Set Orientation to Landscape.

4. Set your document (under Multiple Pages in Word 2002) to 2 pages per sheet.

5. Apply to whole document.

6. Click OK.

7. Click View > Zoom.

8. Click Many Pages.

9. Select two pages.

10. Click OK.

Thanks, Thomas!

Dan A. Wilson wrote:

I think your position is the right one: it isn't a matter of TELLING people HOW TO ADJUST, but of REMINDING them TO REMEMBER to resize or zoom, or both. I, too, have seen countless cases of tennis-match-spectator neck syndrome caused by the use of a newly purchased monitor at full display max. Especially now that LCDs are so widely in use, it's important that users learn to adjust window sizes.

Almost all of my programs except Word and my browsers now run in windows that show my desktop wallpaper behind them on all four sides, because running them any larger than that on a 19" LCD is just plain silly unless you're viewing them from across the room. In Word, I either run single document pages at 80 to 90 percent zoom, or side-by-side pages at 75 percent, and the displays of the latter are STILL larger than those of pages at maximized display and 100 percent zoom on my 17" CRT on the other desk.

The advantage of the larger monitors today is that you can display MORE; using them to display the same old stuff LARGER is pointless for most programs, and an invitation to whiplash injuries.

Large LCD monitors have very high native resolution settings, and are optimized for those settings. Running a 17" LCD monitor at a resolution of 800 x 600 is not only bad for the monitor but bad for the eyes: even the best image available at that resolution on such a monitor will be fuzzy.

I use a 19" LCD with Word windows maximized but with my zoom set to 90% normally. Gives me a slightly larger-than-lifesize view of the page.

Most of the time, though, I use the taskbar right-click control to Tile Windows Vertically, so that I can have two different docs or two different views (or versions) of the same doc open side-by-side, each with its own toolbar. I set the zoom for each doc to 75% then, and the page on the screen is still about the size of an 8.5 x 11 sheet. This is great when I want to check text against the Biblio for presence and identity of entry info, for instance.

Thanks, Dan!

_________________________________________

RESOURCES

Ed Millis wrote:

Your readers might be interested in the Google search add-in. Previously only for Excel, it has now been updated in a Word form also [for both PC and Mac]. This is great! I use it all the time to search newsgroups for information on specific issues. You can find it here:

http://www.rondebruin.nl/Google.htm.

I must confess that, although I've used Word for many years, I've never really used all it could do. I love macros for doing repetitive stuff, but templates? styles? and all the other neat things? Never touched them. Your newsletter, I believe, is going to help me tremendously! Again, many thanks.

Thanks, Ed!

Setting Up Book Pages

Two weeks ago, I explained how to calculate page margins when typesetting a book in Microsoft Word. I neglected, however, to explain how to set up the pages themselves. So here goes.

For most books, you'll need three different page layouts:

1. The first page of a chapter.

2. A left (verso) page.

3. A right (recto) page.

Dedicated typesetting programs allow you to set these up using "master pages" that act as page templates. Word lacks such a feature but still makes it possible to set up different kinds of pages. Here's how:

1. Create a new Word document.

2. Click File > Page Setup. On a Macintosh, click the "Margins" button.

3. Click the Layout tab. Notice that the preview shows only one page.

4. Under "Section start," select "Odd page" if you want every chapter to start on the traditional odd page, or "New page" if you want to let the chapters fall where they may. Yes, you can start chapters on even pages if you insist.

5. Under "Headers and footers," put a checkmark in the boxes labeled "Different odd and even" and "Different first page." The preview now shows two pages. Hey, this is starting to look like a page layout!

6. Go back to the Margins tab.

7. Notice that you can set margin size for top and bottom, left and right. In Word 2002 or later, under "Pages," select "Mirror margins" from the dropdown list. In Word 97, 98, 2000, or 2001, put a checkmark in the box labeled "Mirror margins." Notice that "Left" and "Right" have become "Inside" and "Outside."

8. Set the margins for your pages. (You can include extra for the gutter if your printing needs require it, but I try to avoid this.) See the newsletter for January 28 for more information:

http://www.topica.com/lists/editorium/read/message.html?mid=1715887587

9. Click the OK button to put your decisions into effect.

Next week: Setting up headers and footers.

_________________________________________

READERS WRITE

I had some interesting responses to last week's feature article, "Size and Zoom." Some readers misunderstood (or I miscommunicated). The point of the article wasn't "Here's how to size your Word window." It was "Hey! Size your Word window!" Apologies to those who thought the article was too elementary.

And many thanks to Dan A. Wilson and Eric Fletcher, expert editors both, who sent the following useful messages:

Dan wrote:

I think your position is the right one: it isn't a matter of TELLING people HOW TO ADJUST, but of REMINDING them TO REMEMBER to resize or zoom, or both. I, too, have seen countless cases of tennis-match-spectator neck syndrome caused by the use of a newly purchased monitor at full display max. Especially now that LCDs are so widely in use, it's important that users learn to adjust window sizes.

Almost all of my programs except Word and my browsers now run in windows that show my desktop wallpaper behind them on all four sides, because running them any larger than that on a 19" LCD is just plain silly unless you're viewing them from across the room. In Word, I either run single document pages at 80 to 90 percent zoom, or side-by-side pages at 75 percent, and the displays of the latter are STILL larger than those of pages at maximized display and 100 percent zoom on my 17" CRT on the other desk.

The advantage of the larger monitors today is that you can display MORE; using them to display the same old stuff LARGER is pointless for most programs, and an invitation to whiplash injuries.

Large LCD monitors have very high native resolution settings, and are optimized for those settings. Running a 17" LCD monitor at a resolution of 800 x 600 is not only bad for the monitor but bad for the eyes: even the best image available at that resolution on such a monitor will be fuzzy.

I use a 19" LCD with Word windows maximized but with my zoom set to 90% normally. Gives me a slightly larger-than-lifesize view of the page.

Most of the time, though, I use the taskbar right-click control to Tile Windows Vertically, so that I can have two different docs or two different views (or versions) of the same doc open side-by-side, each with its own toolbar. I set the zoom for each doc to 75% then, and the page on the screen is still about the size of an 8.5 x 11 sheet. This is great when I want to check text against the Biblio for presence and identity of entry info, for instance.

Eric Fletcher wrote:

One of my favorite techniques to pass on to friends with similarly-aging eyes is to use the mouse scroll wheel to zoom in and out. Most Microsoft applications will zoom in and out when the CTRL button is pressed while rolling the wheel. I'm not sure what the Mac equivalent is but I assume there would be one.

In "Normal" view with wrap to window on, zooming in increases the font size and wraps the text in whatever size window you have available. Zooming to a much larger size temporarily is great to be able to differentiate between accents or footnote numbers.

In "Print" view, zooming way out to 10% lets you see thumbnails of many pages at once (15x7 with my current monitor settings). You can't read anything of course, but you can get somewhere within the document very quickly if you recognize the structure (say for chapter starts, tables of contents, tables, or graphics): just click in the page and zoom in.

The other use I've found for this capability recently is in conjunction with the "find all" capability for either Find or select all in the style task pane. When elements are selected--and therefore highlighted--many will be off the current screen. If you zoom out, you can see more selections at once. Of course, clicking to be able to zoom in removes the selection highlight but since both the Find dialog and task pane are modal, it is easy to reinstate the highlights to see any local to where you clicked.

_________________________________________

RESOURCES

Want to more about book design and page layout? Here are a couple of good places to get started:

Jacci Howard Bear's graphic design tutorials and procedures:

http://desktoppub.about.com/cs/graphicdesign/

John Magnik's typography and page layout tutorials:

http://www.typography-1st.com/typo/txt-lay.htm

Calculating Page Margins

In past newsletters, I've sometimes discussed aspects of typesetting in Microsoft Word:

http://www.topica.com/lists/editorium/read/message.html?mid=1708754845

http://www.topica.com/lists/editorium/read/message.html?mid=1708956278

http://www.topica.com/lists/editorium/read/message.html?mid=1711888513

http://www.topica.com/lists/editorium/read/message.html?mid=1711932079

http://www.topica.com/lists/editorium/read/message.html?mid=1712050217

One item I haven't addressed is how to calculate page margins *for publication,* which is not the same as just clicking File > Page Setup and putting in some numbers. If you needed to set up page margins for a printed book, for example, you'd need to set your margins to accommodate the size of your page and the text block on your page. This is not the same as *paper* size, which we'll say is 8.5 by 11 inches, a U.S. standard.

Let's say you want your *page* size (the size of your book) to be 7.5 by 9 inches. Here's what you'd do:

1. Calculate the margins you'd need to set to obtain the trimmed page size. (If you were using crop marks, this would be the measurement between them and the edge of the paper.)

- For the side margins, 8.5 minus 7.5 equals 1 inch. Divide that inch in half because, by golly, you have two side margins. That gives you a margin of .5 inch on each side of the page.

- For the top and bottom margins, 11 minus 9 equals 2 inches, divided in half equals a 1-inch margin for top and bottom.

2. Calculate the space from the edges of your *text block* to the top, bottom, and sides of your page. For convenience, let's say you're going to have 1 inch all around, but you could make those measurements anything you wanted. If it's 1 inch, then you'd just add 1 inch to the side margins, making 1.5 inches for each, and 1 inch to the top and bottom margins, making 2 inches for each.

4. Finally, click File > Page Setup and set your margins according to your calculations: side margins should be 1.5 inches, and top and bottom margins should be 2 inches.

Now when you type in your text, you'll get a text block of, let's see, 8.5 - (1.5 + 1.5) = 5.5 inches wide, and 11 - (2 + 2) = 7 inches high.

Depicted graphically (sort of), here's what we've created:

-----------------

| ------------- |

| | --------- | |

| | | xxx xxxx| | |

| | |xx xxx xx| | |

| | |xxx x xxx| | |

| | |x xx xxxx| | |

| | | xxxx xxx| | |

| | |xx xxx xx| | |

| | |xxx x xxx| | |

| | --------- | |

| ------------- |

-----------------

The smallest box, in the middle, is the text block, 5.5 by 7 inches.

Out from that is the trim size (page size), 7.5 by 9 inches.

And the outside border, of course, is the paper size, 8.5 by 11 inches.

You can adjust the position of header and footer on the page by modifying their paragraph style to include space before and after as needed.

If you need to add crop marks, you should check out our WordSetter program, which will create them for you--yes, even in Microsoft Word:

http://www.editorium.com/14000.htm

http://www.editorium.com/wordsetter/TH_10.htm

Thanks to Dorian Cougias for suggesting this topic.

_________________________________________

READERS WRITE

Autoformatting in Tables

Steve Hudson wrote:

You cannot add new table types to the Table Autoformat list, nor can you edit existing ones.

HOWEVER, you *can* get many more autoformat layouts by setting various properties of the autoformat to FALSE. For example, setting them all to false for table style normal gives you an invisible (borderless) table. By using grid with no first column, font changes, or first row, you get a nice boxed grid. You can also set the default line width for tables as well.

Anything more complex has to be handled by first inserting a table and then styling it up via macro. Ninety-nine out of a hundred times, this is accomplished by styling the table, then styling the first row, then styling the first column.

In Word XP we get table styles (they are making styles even more abominable by giving us different flavours), so I am guessing it MAY be possible to set autoformats of your own to a greater degree. Until they work properly however, I ain't investigating them.

An extract from my Word Spellbook (available from http://www.wordheretic.com):

Auto-formatting or custom default tables (Word 2000)

Highly specialized custom formats are difficult (when not impossible) to do. Very simple formats are somewhat easier and more likely to succeed. If you are prepared to bend your style guide to what is possible and what is not, you can get some satisfactory automatic results without having to resort to macros.

You can also just use Autotext entries to store a pre-formatted table in--you can drag these Autotext entries onto a menu if required.

As a quick aside, when dealing programmatically with tables, there are two subtle tricks. One is the .range.cells(n) object that serializes all the cells in the selected range. This tables(1).range.cells(k) is an easier way to address the collection. Secondly, you can do groovy table stuff via the selection object that you can't via a range . . .

Back to the plot. Inspect the Insert Table > AutoFormat dialog. To get all your tables inserted with invisible borders, select the simple 1 format and CLEAR all the little checkboxes. Viola! Problem solved.

Whatever rows and columns I give it will be the default that is used from there on in when I tick "Use this as the default style."

To extend this concept, you are NOT limited to JUST the formats presented. You can also use just parts of them! We did this above and used NO PARTS to give us NO styling--or an auto-invisible table.

For example, many people could get away with Grid 5 with the font option cleared to make themselves a nice grid.

________________________

Rob Little wrote:

First, in 2002, any new table style becomes a table autoformat (it shows up in the table autoformat dialog).

Second, as for basing your table style on an empty style, use "Table Normal." Table normal is statically defined (like "Default Paragraph Font" for character styles) and cannot be changed by any user. (This is different than the "Normal" paragraph style which can be edited by users). "Table Grid" is the default table style because it includes a Grid border (among other things).

Here are a couple of things to know about table styles:

* Table styles cannot define "structural" elements of a table (merging of cells, etc). This means they cannot define the width of cells or the height of rows (changing the width of cells from row to row implicitly merges cells, for example).

* All table autoformats in Word 2002 are Table Styles and can be customized by the user. (New styles can be based on them, too.) Conversely, this means that anything you see done in the table autoformats can be built from scratch through the table styles user interface.

* Table styles define character, paragraph, and table/cell/row properties. These properties are evaluated *before* the paragraph style's properties (order of calculation is TableStyle + ParaStyle + CharStyle + DirectFormatting = Calculated Properties). For example: If your table style defines the "Whole Table" as being "Arial," then you'll see "Arial." If you apply a paragraph style that applies "Courier," then "Arial" + "Courier" = "Courier" (because the paragraph style wins).

* When you apply a table style, the character and paragraph properties of the table are *not* reset. This is different than paragraph styles, which reset the character properties of the paragraph before applying the paragraph (with some exceptions). So, if you have a table which has a lot of direct character/paragraph formatting (for example, fonts, sizes, justification, and so on), and you apply a Table Style (or Table Autoformat), that direct formatting will still be there (and will beat any table style properties). If you want to get rid of that direct formatting, select the table and choose Edit/Clear/Formats (or click Clear Formatting on the Styles and Formatting taskpane). This will reset the table contents to just the table style. (You can clear formatting before or after applying the table style; it makes no difference.)

* When you use the "Applies to" part of table styles (this is used in almost all autoformats), you are telling Word to run a set of rules against your table when it applies formatting. For example, if you tell it to format the "First Row" with bold text, then every time the table changes, Word makes sure to format the first row with bold text--even if a new first row is inserted. In Word 2000 and before, the table autoformats were "static"--once the last row (for example) was formatted, if you inserted a new last row, you would end up with 2 rows looking like the last row. In 2002, the last row recalculates, and you get just one last row. This allows things like banding (every other row shaded, for example).

* There is a delicate interaction between table styles and the Normal style. Recall that the paragraph style is applied on top of the table style. This means that any formatting you have in your Normal style will almost always override your table style formatting (I say almost because not *all* styles are based on "Normal"). For example: If your Normal style has "Arial" in it, and you apply a table style that defines "Courier," you will see Arial. The paragraph style wins. There is a way around this, but it would take some space to explain, so I'll provide the solution if people ask for it (or I could leave it as an exercise for the reader). Are there really people still reading at this point?

* Because of the interaction between the Normal style and table styles, there is an even more delicate interaction with font sizes. First, you just plain can't force the table to use 10-point text. If you set the table style to 10 points, it won't apply it. I really don't want to try to explain why.

That's all I can think of off the top of my head. I think everything above is accurate, but it's late and its entirely possible that I flubbed something up. Feel free to send me comments or corrections and I'll incorporate them.

Thanks to Steve and Rob for all of this useful information.

_________________________________________

RESOURCES

I recently changed my Web browser's home page from Google to Refdesk.com. Why? Because Refdesk includes a Google search box *and* a bunch of useful references that I've already had occasion to use in editing. Maybe you'll find it helpful too:

http://www.refdesk.com/instant.html

Microsoft's Font Properties Extension

If you work a lot with fonts, you'll probably be interested in Microsoft's Font Properties Extension, which will run under Windows 95 or higher (sorry, Mac users).

The extension makes it possible to display information about a font's origin, copyright, and licensing; its hinting and smoothing; whether or not the font can be embedded in a document; and perhaps most important, the font's character set.

Microsoft notes that the extension also "includes version and date information" and "describes the font in terms of number of glyphs, number of kerning pairs, the possible existence of a euro symbol, and the presence of embedded bitmaps within the font."

It's a useful tool, and best of all, it's free. You can learn more here:

http://www.microsoft.com/typography/property/property.htm

_________________________________________

READERS WRITE

John Renish wrote to say that the previous newsletter's macro to fix all caps in text didn't run. I offer my apologies and the corrected macro:

'THE MACRO STARTS HERE
Sub FixAllCapsInText()
'Macro created 10/26/2003 by Jack M. Lyon
'
Selection.Find.ClearFormatting
With Selection.Find
.Text = "[A-Z]{2,}"
.Replacement.Text = ""
.Forward = True
.Wrap = wdFindStop
.Format = False
.MatchCase = False
.MatchWholeWord = False
.MatchAllWordForms = False
.MatchSoundsLike = False
.MatchWildcards = True
End With
Selection.Find.Execute
While Selection.Find.Found = True
Selection.Range.Case = wdTitleWord
Select Case Selection.Range
Case "A", "An", "As", "At", "And", "But", _
"By", "For", "From", "In", "Into", "Of", _
"On", "Or", "Over", "The", "Through", _
"To", "Under", "Unto", "With"
Selection.Range.Case = wdLowerCase
Case "Usa", "Nasa", "Usda", "Ibm", "Nato"
Selection.Range.Case = wdUpperCase
End Select
Selection.MoveRight Unit:=wdCharacter, Count:=1
Selection.Find.Execute
Wend
MsgBox "Finished!", , "Fix All Caps in Text"
End Sub
'THE MACRO ENDS HERE

If you don't know how to use macros like that one, you can learn how here.

For more information, see our previous newsletter:

http://www.topica.com/lists/editorium/read/message.html?mid=1715274489

Thanks, John.

_________________________________________

RESOURCES

Microsoft's Disagreeably Facetious Type Glossary is hopelessly old-fashioned, unabashedly British, and a delight to read, and it will reveal the meaning of typography terms that you never even knew existed.

http://www.microsoft.com/typography/glossary/content.htm

Unlinking Headers and Footers

If you work much with headers and footers in Microsoft Word, you've probably noticed that when you insert a section break, it's automatically set to "Same as Previous." For many Word users, especially in the publishing world, this is an annoyance. If I'm creating a new chapter in a book, I want my headers to be *different* from those in the previous chapter, not the same. Ideally, there should be a way to set this (or not) as an option:

[] Automatically set headers and footers to "Same as Previous"

As far as I know, this option doesn't exist. If you know otherwise, please let me know, and I'll include your tip in next week's newsletter. In the meantime, here's a macro that will go through a document's headers and footers and unlink them all:

'THE MACRO STARTS HERE
Dim curSection As Section, curHeader As HeaderFooter
For Each curSection In ActiveDocument.Sections
For Each curHeader In curSection.Headers
curHeader.LinkToPrevious = False
Next curHeader
Next curSection
'THE MACRO ENDS HERE

If you don't know how to use macros like that one, you can learn here.

The macro won't set the option permanently, but at least you'll no longer need to unlink all of your headers and footers by hand.

Thanks to Susan Bullowa for suggesting this topic.

_________________________________________

READERS WRITE

I've received so many great tips from readers that I'm simply not able to include everything in this week's newsletter. My apologies to those who are still waiting; in some cases, I'm holding messages to go with a specific article, so they may not show up for several weeks. Thanks for your patience. And please, keep those tips coming!

After reading last week's exchange about automatic formatting in Word's Document Map, Hilary Powers wrote:

After describing the problem of non-heading items showing up in the Document Map, and restoring themselves to it after being removed by a VBA routine, Bill Fosbury wrote:

I will

* run my program to change the outline level to "body text"

* then add the headings that I want

* the next time I open the file the document map builder will be happy

the next time I open the file because I will have some heads and I will

therefore not look for candidates.

To which Jack Lyon writes:

What you've said makes perfect sense. Document Map can't work unless it

has something to work with. So if it doesn't, rather than just sit there

blank (and leave the user wondering why nothing's happening), it

autoformats the document so it *will* have something to work with....

That's certainly the way it looks like it's working, or at least ought to work, but it's not. At any rate, I too have a charming little VBA routine that gets rid of garbage on the Document Map. And when I save the file and reopen it, all the garbage is back. But. Garbage appears unpredictably, whether or not a file has headings defined using Word styles, and as far as I can tell, once a file develops that sort of garbage it never loses it permanently. Meanwhile, if no lines in a file look like headings, the Document Map will show up with nothing in it; it doesn't have to find something and put it up. Nobody's found a fix for this over on the Word-PC list; I've just got the cleaner on a hot key and run it as needed.

I'm not certain if the Document Map ever lets go of stuff it's defined--it may well do so, as many files never develop garbage. But just defining a set of headings is no guarantee that it won't go its own cluttered way....

_____________________

After reading last week's article on automatic dashes, my longtime friend Richard O'Regan wrote:

I disagree with you on the use of the en dash. At least, over here, in "old Europe," the en dash, preceded and followed by a space, is used both in the UK and on the Continent. The em dash no longer sees the light of day.

_____________________

Lindy M wrote:

I share your preference for the em dash rather than the spaced en, but I do use the latter, for one reason: it's easier to control line breaks. With a nonbreaking space before the en dash I can ensure it always breaks at the end of a line rather than at the beginning. I can't find any way to attach an em dash to the end of a word other than spacing it, which would look excessive to my eye. I might add that some people firmly believe that dashes should appear at the beginning of a line, not the end. I don't. I should also confess that I use Word as little as possible. From what I've seen it handles em dash breaks elegantly, but some other word processors and page layout programs need help with this sort of thing, especially older versions.

Thanks to all for their comments and suggestions.

_________________________________________

RESOURCES

I've been almost in despair about what to do about spam. Then here comes an email message from my Aussie mate Steve Hudson, who pointed me to Cloudmark's free SpamNet program:

http://www.cloudmark.com/products/spamnet/

Unlike other such programs, this one uses the power of distributed computing to identify and block spam. It's not foolproof, but it's sure caught an awful lot of spam for me. Check it out!

Hyphenation Exception Dictionary

A few weeks ago, I mentioned that I'd been working on a long, complex book that had to be typeset in Microsoft Word. I learned a lot from the experience, and I'll be passing on some of that hard-won knowledge in future issues. As I worked on the book, one problem quickly became apparent: Microsoft Word has no hyphenation exception dictionary. A hyphenation exception dictionary is a list of words that specifies how certain words should (or should not) be broken at the end of a line. For example, a really tiny hyphenation exception dictionary might include the following entries as words that shouldn't be broken at all:

people

little

create

It might also include the following words, with optional hyphens indicating breaking points:

con-vert-ible (not con-ver-ti-ble)

tan-gible (not tang-i-ble)

tri-angle (not trian-gle)

Microsoft Word will break all of those words badly.

Dedicated typesetting programs such as QuarkXPress will automatically check a hyphenation exception dictionary (if you've provided one) and break words accordingly. Microsoft Word won't, but there is a way around the problem. First, compile your hyphenation exception list. Then record a macro that finds each word on your list and replaces it with the same word including optional hyphens and zero-width nonbreaking spaces as needed. You can learn more about zero-width nonbreaking spaces here:

http://www.topica.com/lists/editorium/read/message.html?mid=1711888513

And you can learn more about optional hyphens here:

http://www.topica.com/lists/editorium/read/message.html?mid=1711932079

Using the words above, our list might look like this (I'm using a hyphen [-] to represent optional hyphens and a tilde [~] to represent a zero-width nonbreaking space):

peo~ple

lit~tle

cre~ate

con-vert-ible

tan-gible

tri-angle

So you'd find "people" and replace it with "peo~ple," "triangle" and replace it with "tri-angle," and so on. Then, when Word does its automatic hyphenation, the words will break in the way you've specified rather than in the (incorrect) way Microsoft Word uses by default (using, in my case, American English rules). It's not that Word does a bad job of hyphenation, mind you. It's actually pretty good. But even the best hyphenation algorithms need a little help.

A more elegant (and probably more reliable) way of preventing breaks is to mark the words in question so that they are not "proofed"--that is, so that they won't be checked for spelling, grammar, or (most important) hyphenation. To do that, select a word, click Tools > Language > Set Language, and put a check in the checkbox labeled "Do not check spelling or grammar." This has the advantage of not introducing an invisible character into the word, which will keep an unwanted space from showing up later if you use the document to create a Web page, an ebook, or whatever.

A better way than recording all of these words in a macro is to use our RazzmaTag program, which will run your hyphenation exception list on a whole folder full of documents at one time. It will also let you edit and add to your list as needed. I've prepared preliminary versions of such lists that you can download and play with. The one using the zero-width nonbreaking space is here (this list will work with MegaReplacer as well as RazzmaTag):

http://www.editorium.com/ftp/nonbreakinglist.zip

And the one marking the words so they won't be proofed is here (RazzmaTag only):

http://www.editorium.com/ftp/noproofinglist.zip

But what I'd really like is for you to send me any hyphenation exception lists you already have (maybe check with your typesetter). Then I'll merge them and include the comprehensive list in next week's newsletter! Come on--what do you say? Please email your lists to mailto:editor [at symbol] editorium.com.

You can learn more about RazzmaTag here:

http://www.editorium.com/razzmatag.htm

And you can learn more about MegaReplacer here:

http://www.editorium.com/14843.htm

_________________________________________

READERS WRITE

After reading the article on finding and replacing weird WordPerfect characters wrote:

I work with Word 98 on a Mac and occasionally tangle with WP documents. A few days ago, a writer sent a WP file to me. I opened it in Word, as Text, and found lots of gibberish--my favorite character being the letter Y with two dots over it: . This character was interspersed between *every* legitimate letter and space. (I would see, basically, this: letter and space.) Copying it and using Find and Replace was fruitless. Word refused to cooperate.

I decided to try opening the file with Word, but not as ASCII Text. Opening it as RTF gave me the same results, as did opening it as a Word Document. But I then tried an option called "Recover Text from Any File," and the document opened with text that was absolutely clean. I mean *really* clean.

The only caveat I can think of is that there was no special formatting in this file. I'm responsible for formatting the document and sending it on to my editor.

Yateendra Joshi (yateen@teri.res.in) wrote:

Thank you for the interesting and useful article on ellipses in the 16 January 2001 issue of Editorium Update.

Most often, ellipses stand for omitted matter, and the dots will represent it even better if they do not sit on the line but are raised a bit, say to the centre of the letter x (lowercase eks). The extent to which the dots should be raised will depend on the font (raising by 2 points works best with 11-point Georgia). The sequence is therefore to type the dots as you explain, then select them, and raise them by Format > Font > Character Spacing > Position > Raised By followed by typing in the appropriate value. It helps to see the text enlarged by 500%.

Thanks to Fran and Yateendra for the great tips!

Break That Word Here!

Last week's newsletter explained how to use a zero-width nonbreaking space to keep a word from breaking at the end of a line when hyphenation is turned on (Tools > Language > Hyphenation > Automatically hyphenate document). Fine as far as it goes. But what can you do to break a word at a place other than one Microsoft Word insists on using? For example, Word will happily break "convertible" as "converti-ble." Ugh. (See your favorite style manual for more information about how to break words properly; I prefer The Chicago Manual of Style.)

The solution is to insert an optional hyphen at any acceptable breaking points. In "convertible," for example, you could insert optional hyphens as follows: con-vert-ible. The optional hyphens will override word's automatic hyphenation and break the word at one of the points you've specified.

To get an optional hyphen, click Insert > Symbol > Special Characters > Optional hyphen. Or, easier yet, press CTRL + - (on a Macintosh press COMMAND + -).

In our shop, proofreaders check galleys for bad breaks, which are then corrected manually by our typesetters, who insert optional hyphens as needed (although usually in QuarkXPress rather than Word). Wouldn't it be nice if there was a way to insert optional hyphens automatically? As it turns out, there is--even in Microsoft Word.

Stay tuned; next week I'll tell you all about it.

You can learn more about The Chicago Manual of Style here:

http://www.press.uchicago.edu/cgi-bin/hfs.cgi/00/12245.ctl

And you can see the FAQ here:

http://www.press.uchicago.edu/Misc/Chicago/cmosfaq/cmosfaq.html

_________________________________________

READERS WRITE

After reading last week's article about how to use a zero-width nonbreaking space to keep a word from breaking, Patsy Price sent a tip about an elegant alternative:

I too have been very frustrated when specific words insisted on breaking in Word 98 (Mac) whether I wanted them to or not. I tried everything I could think of, including inserting a nonbreaking hyphen before the word, but nothing worked. Then somebody on one of the lists I belong to made a suggestion that has worked for me so far: select the word and change the language to No Proofing [Tools > Language > Do not check spelling or grammar]. Even when the file is opened in Word 2000 PC the word doesn't hyphenate.

Patsy made the effort to track down the person who originally made the suggestion, H?l?ne Dion on the McEdit list. So thanks to H?l?ne for the tip and to Patsy for passing it on.

Bill Rubidge (wbr@aya.yale.edu) sent the following tip on how to make a zero-width nonbreaking hyphen in Word 97, along with a brilliant wildcard find-and-replace routine to keep words together at the end of a paragraph.

Interesting zero-width action. In my case I wanted to break long URLs in a narrow text column. Unfortunately, I am still using Word 97, so I had to resort to a conditional hyphen solution, but I set the hyphen size to 1 point and colored it white to hide it.

In any case, my experience on that issue and your description of the one below made me think you could take your "Don't break that word" solution a step further. I never use hyphenation, so I don't have your issue, but I dislike short words ending up all by their lonesome as the final line of a paragraph. My solution is:

Search for:

([A-Za-z0-9,.$?;:'"")!*]{1,8}) ([A-Za-z0-9,.$?;:'"")!*]{1,8})[^013]

Replace with:

1^s2

This forces the last two words (up to eight characters long) to be on the last line together.

Your hyphenation problem seems similar, but I shudder at the thought of inserting the Unicode characters manually. Would it do the job for you to search for the end of a paragraph and then insert the nonbreaking zero-width space character between EVERY letter of the last word? This way, you could run this macro automatically for the whole document.

By the way, I found I had to do an additional undo search to take out these things where I knew that the item was part of a small column. For example, if the found item was in a table, I would undo the nonbreaking material, as the table columns might be too narrow for this to be appropriate.

Thanks to Bill for the great tips.

_________________________________________

RESOURCES

Possibly the ultimate treatise on the subject, the fascinating book Hyphenation, by Ronald McIntosh and David Fawthrop, is available free online:

http://www.hyphenologist.co.uk/book/BOOK-ED3.HTM