36:
245:
312:
file, but that it has no information beyond "the text itself"—no representation of bold or italics, paragraph, page, chapter, or footnote boundaries, etc. Michael S. Hart, for example, argued that this "is the only text mode that is easy on both the eyes and the computer". Hart made the correct point
338:
In consequence of this, such texts cannot be reliably re-formatted. A program cannot reliably tell where footnotes, headers or footers are, or perhaps even paragraphs, so it cannot re-arrange the text, for example to fit a narrower screen, or read it aloud for the visually impaired. Programs might
334:
might be omitted, or might simply appear as additional lines of text, perhaps with blank lines before and after (or not). An ornate separator line might be represented instead by a line of asterisks (or not). Chapter and sections titles, likewise, are just additional lines of text: they might be
313:
that proprietary word-processor formats made texts grossly inaccessible; but that is irrelevant to standard, open data formats. The narrow sense of "e-text" is now uncommon, because the notion of "just vanilla ASCII" (attractive at first glance), has turned out to have serious difficulties:
374:
output usually produces more information than this, such as the use of bold and italic. If this information is not kept, it is expensive and time-consuming to reconstruct it; more sophisticated information such as what edition you have, may not be recoverable at all.
382:, spaces, tabs, and the like: Spaces between words; two returns and 5 spaces for paragraph. The main difference from more formal markup is that "plain texts" use implicit, usually undocumented conventions, which are therefore inconsistent and difficult to recognize.
385:
The narrow sense of e-text as "plain vanilla ASCII" has fallen out of favor. Nevertheless, many such texts are freely available on the Web, perhaps as much because they are easily produced as because of any purported portability advantage. For many years
355:
relating to the text is sometimes included with an e-text, but there is by this definition no way to say whether or where it is preset. At best, the text of the title page might be included (or not), perhaps with centering imitated by indentation.
350:
the work. For example, is it the first or the tenth edition? Who prepared it, and what rights do they reserve or grant to others? Is this the raw version straight off a scanner, or has it been proofread and corrected?
186:
information, or not. An e-text may be an electronic edition of a work originally composed or published in other media, or may be created in electronic form originally. The term is usually synonymous with
335:
detectable by capitalization if they were all caps in the original (or not). Even to discover what conventions (if any) were used, makes each book a new research or reverse-engineering project.
367:, or even the simplest tables. This leads to endless practical problems: for example, if the computer cannot reliably distinguish footnotes, it cannot find a phrase that a footnote interrupts.
320:
or the accented vowels used in many
European languages cannot be represented (unless awkwardly and ambiguously as "~n" "a'"). Asian, Slavic, Greek, and other writing systems are impossible.
464:
326:
Third, "e-texts" in this narrow sense have no reliable way to distinguish "the text" from other things that occur in a work. For example, page numbers,
323:
Second, diagrams and pictures cannot be accommodated, and many books have at least some such material; often it is essential to the book.
452:
Reading and
Writing the Electronic Book. Nicole Yankelovich, Norman Meyrowitz, and Andries van Dam. IEEE Computer 18(10), October 1985.
433:
231:, and other information in their texts, as well as in some cases (such as FRESS) supporting not just text but also graphics.
154:, and especially a document that is mainly text. For example, a computer-based book of art with minimal text, or a set of
491:
301:
In some communities, "e-text" is used much more narrowly, to refer to electronic documents that are, so to speak, "plain
288:
119:
100:
270:
390:
strongly favored this model of text, but with time, has begun to develop and distribute more capable forms such as
72:
346:
Fourth, and a perhaps surprisingly important issue, a "plain-text" e-text affords no way to represent information
17:
548:
255:
57:
79:
371:
359:
Fifth, texts with more complicated information cannot really be handled at all. A bilingual edition, or a
86:
482:
203:, have been around since long before the Internet, the Web, and specialized E-book reading hardware.
428:
68:
266:
53:
46:
543:
216:
175:
8:
302:
200:
147:
316:
First, this narrow type of "e-text" is limited to the
English letters. Not even Spanish
553:
505:
423:
387:
379:
509:
93:
495:
413:
364:
360:
418:
224:
179:
262:
183:
537:
223:
appeared in the 1960s. These early systems made extensive use of formatting,
159:
204:
151:
453:
327:
171:
163:
137:
526:
500:
477:
476:
Coombs, James H.; Renear, Allen H.; DeRose, Steven J. (November 1987).
340:
309:
228:
167:
155:
403:
212:
273:. Statements consisting only of original research should be removed.
35:
378:
If actuality, even "plain text" uses some kind of "markup"—usually
352:
331:
208:
408:
188:
162:, would not usually be called an "e-text". An e-text may be a
305:
220:
478:"Markup systems and the future of scholarly text processing"
391:
317:
211:
in the 1940s, while large-scale electronic text editing,
343:
to guess at the structure, but this can easily fail.
308:". By this is meant not only that the document is a
60:. Unsourced material may be challenged and removed.
475:
535:
363:with footnotes, commentary, critical apparatus,
234:
528:Scholarly Electronic Publishing Bibliography
207:began developing an electronic edition of
499:
434:L'Association des Bibliophiles Universels
289:Learn how and when to remove this message
120:Learn how and when to remove this message
27:Any document that is read in digital form
215:, and online reading platforms such as
14:
536:
454:http://dl.acm.org/citation.cfm?id=4407
238:
58:adding citations to reliable sources
29:
24:
25:
565:
520:
194:
243:
227:, automatic tables of contents,
34:
45:needs additional citations for
469:
458:
446:
13:
1:
439:
146:) is a general term for any
7:
397:
269:the claims made and adding
10:
570:
483:Communications of the ACM
429:Distributed Proofreaders
142:"; sometimes written as
170:file, viewed with any
549:Electronic publishing
178:. An e-text may have
201:electronic documents
176:proprietary software
54:improve this article
501:10.1145/32206.32209
380:control characters
254:possibly contains
424:Online Books Page
388:Project Gutenberg
370:Even raw scanner
299:
298:
291:
256:original research
235:"Just plain text"
130:
129:
122:
104:
16:(Redirected from
561:
514:
513:
503:
473:
467:
462:
456:
450:
414:Electronic paper
365:cross-references
361:critical edition
294:
287:
283:
280:
274:
271:inline citations
247:
246:
239:
150:that is read in
125:
118:
114:
111:
105:
103:
62:
38:
30:
21:
569:
568:
564:
563:
562:
560:
559:
558:
534:
533:
523:
518:
517:
474:
470:
465:Michael S. Hart
463:
459:
451:
447:
442:
419:Digital library
400:
295:
284:
278:
275:
260:
248:
244:
237:
197:
126:
115:
109:
106:
63:
61:
51:
39:
28:
23:
22:
18:Electronic text
15:
12:
11:
5:
567:
557:
556:
551:
546:
532:
531:
522:
521:External links
519:
516:
515:
468:
457:
444:
443:
441:
438:
437:
436:
431:
426:
421:
416:
411:
406:
399:
396:
297:
296:
251:
249:
242:
236:
233:
196:
195:E-text origins
193:
160:scans of pages
128:
127:
42:
40:
33:
26:
9:
6:
4:
3:
2:
566:
555:
552:
550:
547:
545:
544:Books by type
542:
541:
539:
530:
529:
525:
524:
511:
507:
502:
497:
493:
489:
485:
484:
479:
472:
466:
461:
455:
449:
445:
435:
432:
430:
427:
425:
422:
420:
417:
415:
412:
410:
407:
405:
402:
401:
395:
393:
389:
383:
381:
376:
373:
368:
366:
362:
357:
354:
349:
344:
342:
336:
333:
329:
324:
321:
319:
314:
311:
307:
304:
293:
290:
282:
272:
268:
264:
258:
257:
252:This section
250:
241:
240:
232:
230:
226:
222:
218:
214:
210:
206:
202:
192:
190:
185:
181:
177:
173:
169:
165:
161:
157:
153:
149:
145:
141:
139:
134:
124:
121:
113:
102:
99:
95:
92:
88:
85:
81:
78:
74:
71: –
70:
66:
65:Find sources:
59:
55:
49:
48:
43:This article
41:
37:
32:
31:
19:
527:
487:
481:
471:
460:
448:
384:
377:
369:
358:
347:
345:
337:
328:page headers
325:
322:
315:
300:
285:
276:
253:
205:Roberto Busa
199:E-texts, or
198:
152:digital form
143:
136:
132:
131:
116:
110:January 2013
107:
97:
90:
83:
76:
64:
52:Please help
47:verification
44:
494:: 933–947.
172:open source
156:photographs
538:Categories
440:References
341:heuristics
310:plain text
279:April 2015
263:improve it
229:hyperlinks
184:formatting
168:plain text
138:electronic
80:newspapers
554:Documents
404:Text file
332:footnotes
267:verifying
213:hypertext
182:or other
510:59941802
398:See also
353:Metadata
148:document
69:"E-text"
303:vanilla
261:Please
217:Augment
209:Aquinas
135:(from "
94:scholar
508:
490:(11).
409:e-book
339:apply
330:, and
225:markup
189:e-book
180:markup
164:binary
133:e-text
96:
89:
82:
75:
67:
506:S2CID
348:about
306:ASCII
221:FRESS
166:or a
144:etext
101:JSTOR
87:books
392:HTML
219:and
140:text
73:news
496:doi
492:ACM
372:OCR
265:by
174:or
158:or
56:by
540::
504:.
488:30
486:.
480:.
394:.
191:.
512:.
498::
318:ñ
292:)
286:(
281:)
277:(
259:.
123:)
117:(
112:)
108:(
98:·
91:·
84:·
77:·
50:.
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.