393:
process of going through intermediary stages, or involving complex "exporting" and "importing" procedures, which may include converting to and from a tab-delimited or comma-separated text file. In some cases, a program may recognize several data file formats at the data input stage and then is also capable of storing the output data in several different formats. Such a program may be used to convert a file format. If the source format or target format is not recognized, then at times a third program may be available which permits the conversion to an intermediate format, which can then be reformatted using the first program. There are many possible scenarios.
36:
638:, is one example. Using a WYSIWYG HTML editor conflates the two paradigms, and the result is HTML files with suboptimal, if not nonstandard, code. In the WYSIWYG paradigm a double linebreak signifies a new paragraph, as that is the visual cue for such a construct, but a WYSIWYG HTML editor will usually convert such a sequence to <BR><BR>, which is structurally no new paragraph at all. As another example, converting from
584:
necessarily involves loss of formatting information, because plain text format does not support word processing constructs such as marking a word as boldface. For this reason, conversion from one format to another which does not support a feature that is important to the user is rarely carried out, though it may be necessary for interoperability, e.g. converting a file from one version of
642:
to an editable word processor format is a tough chore, because PDF records the textual information like engraving on stone, with each character given a fixed position and linebreaks hard-coded, whereas word processor formats accommodate text reflow. PDF does not know of a word space character—the
583:
The objective of data conversion is to maintain all of the data, and as much of the embedded information as possible. This can only be done if the target format supports the same features and data structures present in the source file. Conversion of a word processing document to a plain text file
367:
is predicated on certain standards for data and file handling. Furthermore, each computer program handles data in a different manner. Whenever any one of these variables is changed, data must be converted in some way before it can be used by a different computer, operating system or program. Even
534:
first and from that to
Windows-1251. This is a more manageable approach; rather than needing lookup tables for all possible pairs of character encodings, an application needs only one lookup table for each character set, which it uses to convert to and from Unicode, thereby scaling the number of
392:
There are many ways in which data is converted within the computer environment. This may be seamless, as in the case of upgrading to a newer version of a computer program. Alternatively, the conversion may require processing by the use of a special conversion program, or it may involve a complex
455:
pixel values with zeroes, so that a pixel with a value of FF C3 56, for example, becomes FF00 C300 5600. The conversion makes it possible to change a pixel to have a value of, for instance, FF80 C340 56A0, but the conversion itself does not do that, only further
488:) format only wastes space, since the same image with its loss of original information (the artifacts of lossy compression) becomes the target. A JPEG image can never be restored to the quality of the original image from which it was made, no matter how much the user tries the "
562:
directly; instead, when loading the PCX image, it decodes it to a simple bitmap format for internal use in memory, and when commanded to convert to PNG, that memory image is converted to the target format. An audio converter that converts from
643:
space between two letters and the space between two words differ only in quantity. Therefore, a title with ample letter-spacing for effect will usually end up with spaces in the word processor file, for example INTRODUCTION with spacing of 1
372:
from one format to another, usually for the purpose of application interoperability or of the capability of using new features, is merely a data conversion. Data conversions may be as simple as the conversion of a
442:
since it is rule-based; whereas the addition of color information to a grayscale image cannot be reliably done programmatically, as it requires adding new information, so any attempt to add color would require
538:
Pivotal conversion is similarly used in other areas. Office applications, when employed to convert between office file formats, use their internal, default file format as a pivot. For example, a
693:
659:
will be needed to carry out conversion. Reverse engineering can achieve close approximation of the original specifications, but errors and missing features can still result.
655:
Successful data conversion requires thorough knowledge of the workings of both source and target formats. In the case where the specification of a format is unknown,
506:
Because of these realities of computing and information theory, data conversion is often a complex and error-prone process that requires the help of experts.
755:
603:. Of course, this is not an optimal solution, and can impact operations like searching and copying; and if a language makes a distinction between
327:
588:
to an earlier version to enable transfer and use by other users who do not have the same later version of Word installed on their computer.
718:
514:
Data conversion can occur directly from one format to another, but many applications that convert between multiple formats use an
614:
Data conversion can also suffer from inexactitude, the result of converting between formats that are conceptually different. The
100:
765:
72:
591:
Loss of information can be mitigated by approximation in the target format. There is no way of converting a character like
401:
Before any data conversion is carried out, the user or application programmer should keep a few basics of computing and
79:
667:
Data format conversion can also occur at the physical layer of an electronic communication system. Conversion between
320:
119:
575:
data in memory first, and then performs the lossy AAC compression on that memory image to produce the target file.
53:
430:
image can easily be converted to grayscale, while the opposite conversion is a painstaking process. Converting a
169:
135:
86:
788:
57:
368:
different versions of these elements usually involve different data structures. For example, the changing of
202:
68:
729:
599:, since the ASCII standard lacks it, but the information may be retained by approximating the character as
439:
313:
724:
530:
using a lookup table between the two encodings, but the modern approach is to convert the KOI8-R file to
515:
222:
24:
419:
format does not add information; it merely makes room for that addition, which usually a human must do.
559:
518:
by way of which any source format is converted to its target. For example, it is possible to convert
473:
469:
448:
639:
274:
179:
500:
301:
193:
46:
806:
572:
568:
93:
359:
is built on the basis of certain standards, which requires that data contains, for example,
489:
174:
381:
system to another; or more complex, such as the conversion of office file formats, or the
8:
703:
656:
409:
Information can easily be discarded by the computer, but adding information takes effort.
280:
734:
688:
672:
619:
402:
378:
352:
238:
784:
761:
496:
457:
386:
356:
554:
and then that to WordPerfect format. An image conversion program does not convert a
19:
This article is about conversion of file formats. For conversion of data types, see
543:
438:(DOS/Windows) text file involves adding characters, but this does not increase the
382:
364:
269:
644:
519:
444:
243:
20:
753:
708:
698:
676:
585:
539:
481:
296:
264:
164:
800:
344:
551:
527:
427:
416:
184:
159:
154:
713:
694:
Comparison of programming languages (basic instructions)#Data conversions
547:
422:
Data stored in an electronic format can be quickly modified and analyzed.
348:
360:
456:
manipulation of the image can. Converting an image or audio file in a
451:
to a 48-bit one does not add information to it, it only pads existing
668:
435:
374:
35:
622:
applications, versus the structural-descriptive paradigm, found in
149:
615:
531:
447:
by the computer based on previous knowledge. Converting a 24-bit
635:
523:
465:
412:
The computer can add information only in a rule-based fashion.
212:
596:
495:
Automatic restoration of information that was lost through a
754:
Dragos-Anton
Manolescu; Markus Voelter; James Noble (2006).
611:, then that approximation does involve loss of information.
23:. For conversion of analog information to digital data, see
631:
623:
564:
492:
Removal" feature of his or her image manipulation program.
477:
461:
431:
627:
555:
485:
452:
369:
217:
207:
351:
to another. Throughout a computer environment, data is
499:
process would probably require important advances in
747:
578:
60:. Unsourced material may be challenged and removed.
647:as I N T R O D U C T I O N on the word processor.
650:
798:
760:. Addison-Wesley Professional. pp. 271–.
630:and many applications derived therefrom, like
321:
415:Upsampling the data or converting to a more
783:. Upper Saddle River, NJ: Addison-Wesley.
328:
314:
16:Conversion of digital data between formats
778:
535:tables down from hundreds to a few tens.
120:Learn how and when to remove this message
719:Distributed Data Management Architecture
618:paradigm, extant in word processors and
799:
781:Pattern Languages of Program Design 5
757:Pattern Languages of Program Design 5
509:
396:
679:can be accomplished when necessary.
58:adding citations to reliable sources
29:
355:in a variety of ways. For example,
13:
14:
818:
579:Lost and inexact data conversion
34:
571:decodes the source file to raw
45:needs additional citations for
662:
651:Open vs. secret specifications
550:file by converting the RTF to
1:
779:Manolescu, FirstName (2006).
740:
730:Source-to-source translation
7:
725:Code conversion (computing)
682:
516:intermediate representation
383:conversion of image formats
25:Analog-to-digital converter
10:
823:
18:
231:Techniques and transforms
480:) or uncompressed (like
405:in mind. These include:
302:Transformation languages
275:Extract, transform, load
194:Transformation languages
501:artificial intelligence
363:checks. Similarly, the
343:is the conversion of
175:Model transformation
54:improve this article
704:Data transformation
657:reverse engineering
281:Web template system
170:Data transformation
136:Data transformation
735:Presentation layer
689:Character encoding
620:desktop publishing
510:Pivotal conversion
403:information theory
397:Information basics
387:audio file formats
379:character encoding
239:Identity transform
767:978-0-321-32194-7
497:lossy compression
357:computer hardware
338:
337:
130:
129:
122:
104:
69:"Data conversion"
814:
794:
772:
771:
751:
365:operating system
330:
323:
316:
270:Data integration
132:
131:
125:
118:
114:
111:
105:
103:
62:
38:
30:
822:
821:
817:
816:
815:
813:
812:
811:
797:
796:
791:
776:
775:
768:
752:
748:
743:
685:
665:
653:
581:
542:may convert an
512:
434:text file to a
426:For example, a
399:
341:Data conversion
334:
260:Data conversion
244:Data refinement
126:
115:
109:
106:
63:
61:
51:
39:
28:
21:Type conversion
17:
12:
11:
5:
820:
810:
809:
789:
774:
773:
766:
745:
744:
742:
739:
738:
737:
732:
727:
722:
716:
711:
709:Data wrangling
706:
701:
699:Data migration
696:
691:
684:
681:
664:
661:
652:
649:
586:Microsoft Word
580:
577:
540:word processor
511:
508:
424:
423:
420:
413:
410:
398:
395:
336:
335:
333:
332:
325:
318:
310:
307:
306:
305:
304:
299:
297:Data wrangling
291:
290:
286:
285:
284:
283:
278:
272:
267:
265:Data migration
262:
254:
253:
249:
248:
247:
246:
241:
233:
232:
228:
227:
226:
225:
220:
215:
210:
205:
197:
196:
190:
189:
188:
187:
182:
177:
172:
167:
165:Data migration
162:
157:
152:
144:
143:
139:
138:
128:
127:
42:
40:
33:
15:
9:
6:
4:
3:
2:
819:
808:
807:Computer data
805:
804:
802:
795:
792:
786:
782:
769:
763:
759:
758:
750:
746:
736:
733:
731:
728:
726:
723:
720:
717:
715:
712:
710:
707:
705:
702:
700:
697:
695:
692:
690:
687:
686:
680:
678:
674:
670:
660:
658:
648:
646:
641:
637:
633:
629:
625:
621:
617:
612:
610:
606:
602:
598:
594:
589:
587:
576:
574:
570:
566:
561:
557:
553:
549:
545:
541:
536:
533:
529:
525:
521:
517:
507:
504:
502:
498:
493:
491:
490:JPEG Artifact
487:
483:
479:
475:
471:
467:
463:
460:format (like
459:
454:
450:
446:
441:
437:
433:
429:
421:
418:
414:
411:
408:
407:
406:
404:
394:
390:
388:
384:
380:
376:
371:
366:
362:
358:
354:
350:
346:
345:computer data
342:
331:
326:
324:
319:
317:
312:
311:
309:
308:
303:
300:
298:
295:
294:
293:
292:
288:
287:
282:
279:
276:
273:
271:
268:
266:
263:
261:
258:
257:
256:
255:
251:
250:
245:
242:
240:
237:
236:
235:
234:
230:
229:
224:
223:XML languages
221:
219:
216:
214:
211:
209:
206:
204:
201:
200:
199:
198:
195:
192:
191:
186:
183:
181:
178:
176:
173:
171:
168:
166:
163:
161:
158:
156:
153:
151:
148:
147:
146:
145:
141:
140:
137:
134:
133:
124:
121:
113:
110:November 2023
102:
99:
95:
92:
88:
85:
81:
78:
74:
71: –
70:
66:
65:Find sources:
59:
55:
49:
48:
43:This article
41:
37:
32:
31:
26:
22:
780:
777:
756:
749:
666:
654:
613:
608:
604:
600:
592:
590:
582:
552:OpenDocument
537:
528:Windows-1251
513:
505:
494:
425:
417:feature-rich
400:
391:
340:
339:
259:
252:Applications
185:Preprocessor
160:Data mapping
155:Data element
116:
107:
97:
90:
83:
76:
64:
52:Please help
47:verification
44:
714:Transcoding
663:Electronics
548:WordPerfect
790:0321321944
741:References
669:line codes
546:file to a
522:text from
445:estimation
428:true color
361:parity bit
80:newspapers
558:image to
436:Microsoft
377:from one
375:text file
347:from one
801:Category
683:See also
671:such as
520:Cyrillic
470:lossless
150:Metadata
142:Concepts
616:WYSIWYG
532:Unicode
468:) to a
440:entropy
353:encoded
289:Related
94:scholar
787:
764:
636:MathML
524:KOI8-R
472:(like
466:Vorbis
349:format
213:MOFM2T
96:
89:
82:
75:
67:
721:(DDM)
597:ASCII
458:lossy
277:(ETL)
180:Macro
101:JSTOR
87:books
785:ISBN
762:ISBN
675:and
634:and
632:HTML
624:SGML
607:and
565:FLAC
478:FLAC
462:JPEG
432:Unix
385:and
370:bits
73:news
673:NRZ
640:PDF
628:XML
595:to
573:PCM
569:AAC
567:to
560:PNG
556:PCX
544:RTF
526:to
486:WAV
484:or
482:BMP
476:or
474:PNG
464:or
453:RGB
449:PNG
218:QVT
208:AWK
203:ATL
56:by
803::
677:RZ
645:em
626:,
609:ae
601:ae
503:.
389:.
793:.
770:.
605:ä
593:ä
329:e
322:t
315:v
123:)
117:(
112:)
108:(
98:·
91:·
84:·
77:·
50:.
27:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.