Knowledge

Data conversion

Source 📝

393:
process of going through intermediary stages, or involving complex "exporting" and "importing" procedures, which may include converting to and from a tab-delimited or comma-separated text file. In some cases, a program may recognize several data file formats at the data input stage and then is also capable of storing the output data in several different formats. Such a program may be used to convert a file format. If the source format or target format is not recognized, then at times a third program may be available which permits the conversion to an intermediate format, which can then be reformatted using the first program. There are many possible scenarios.
36: 638:, is one example. Using a WYSIWYG HTML editor conflates the two paradigms, and the result is HTML files with suboptimal, if not nonstandard, code. In the WYSIWYG paradigm a double linebreak signifies a new paragraph, as that is the visual cue for such a construct, but a WYSIWYG HTML editor will usually convert such a sequence to <BR><BR>, which is structurally no new paragraph at all. As another example, converting from 584:
necessarily involves loss of formatting information, because plain text format does not support word processing constructs such as marking a word as boldface. For this reason, conversion from one format to another which does not support a feature that is important to the user is rarely carried out, though it may be necessary for interoperability, e.g. converting a file from one version of
642:
to an editable word processor format is a tough chore, because PDF records the textual information like engraving on stone, with each character given a fixed position and linebreaks hard-coded, whereas word processor formats accommodate text reflow. PDF does not know of a word space character—the
583:
The objective of data conversion is to maintain all of the data, and as much of the embedded information as possible. This can only be done if the target format supports the same features and data structures present in the source file. Conversion of a word processing document to a plain text file
367:
is predicated on certain standards for data and file handling. Furthermore, each computer program handles data in a different manner. Whenever any one of these variables is changed, data must be converted in some way before it can be used by a different computer, operating system or program. Even
534:
first and from that to Windows-1251. This is a more manageable approach; rather than needing lookup tables for all possible pairs of character encodings, an application needs only one lookup table for each character set, which it uses to convert to and from Unicode, thereby scaling the number of
392:
There are many ways in which data is converted within the computer environment. This may be seamless, as in the case of upgrading to a newer version of a computer program. Alternatively, the conversion may require processing by the use of a special conversion program, or it may involve a complex
455:
pixel values with zeroes, so that a pixel with a value of FF C3 56, for example, becomes FF00 C300 5600. The conversion makes it possible to change a pixel to have a value of, for instance, FF80 C340 56A0, but the conversion itself does not do that, only further
488:) format only wastes space, since the same image with its loss of original information (the artifacts of lossy compression) becomes the target. A JPEG image can never be restored to the quality of the original image from which it was made, no matter how much the user tries the " 562:
directly; instead, when loading the PCX image, it decodes it to a simple bitmap format for internal use in memory, and when commanded to convert to PNG, that memory image is converted to the target format. An audio converter that converts from
643:
space between two letters and the space between two words differ only in quantity. Therefore, a title with ample letter-spacing for effect will usually end up with spaces in the word processor file, for example INTRODUCTION with spacing of 1
372:
from one format to another, usually for the purpose of application interoperability or of the capability of using new features, is merely a data conversion. Data conversions may be as simple as the conversion of a
442:
since it is rule-based; whereas the addition of color information to a grayscale image cannot be reliably done programmatically, as it requires adding new information, so any attempt to add color would require
538:
Pivotal conversion is similarly used in other areas. Office applications, when employed to convert between office file formats, use their internal, default file format as a pivot. For example, a
693: 659:
will be needed to carry out conversion. Reverse engineering can achieve close approximation of the original specifications, but errors and missing features can still result.
655:
Successful data conversion requires thorough knowledge of the workings of both source and target formats. In the case where the specification of a format is unknown,
506:
Because of these realities of computing and information theory, data conversion is often a complex and error-prone process that requires the help of experts.
755: 603:. Of course, this is not an optimal solution, and can impact operations like searching and copying; and if a language makes a distinction between 327: 588:
to an earlier version to enable transfer and use by other users who do not have the same later version of Word installed on their computer.
718: 514:
Data conversion can occur directly from one format to another, but many applications that convert between multiple formats use an
614:
Data conversion can also suffer from inexactitude, the result of converting between formats that are conceptually different. The
100: 765: 72: 591:
Loss of information can be mitigated by approximation in the target format. There is no way of converting a character like
401:
Before any data conversion is carried out, the user or application programmer should keep a few basics of computing and
79: 667:
Data format conversion can also occur at the physical layer of an electronic communication system. Conversion between
320: 119: 575:
data in memory first, and then performs the lossy AAC compression on that memory image to produce the target file.
53: 430:
image can easily be converted to grayscale, while the opposite conversion is a painstaking process. Converting a
169: 135: 86: 788: 57: 368:
different versions of these elements usually involve different data structures. For example, the changing of
202: 68: 729: 599:, since the ASCII standard lacks it, but the information may be retained by approximating the character as 439: 313: 724: 530:
using a lookup table between the two encodings, but the modern approach is to convert the KOI8-R file to
515: 222: 24: 419:
format does not add information; it merely makes room for that addition, which usually a human must do.
559: 518:
by way of which any source format is converted to its target. For example, it is possible to convert
473: 469: 448: 639: 274: 179: 500: 301: 193: 46: 806: 572: 568: 93: 359:
is built on the basis of certain standards, which requires that data contains, for example,
489: 174: 381:
system to another; or more complex, such as the conversion of office file formats, or the
8: 703: 656: 409:
Information can easily be discarded by the computer, but adding information takes effort.
280: 734: 688: 672: 619: 402: 378: 352: 238: 784: 761: 496: 457: 386: 356: 554:
and then that to WordPerfect format. An image conversion program does not convert a
19:
This article is about conversion of file formats. For conversion of data types, see
543: 438:(DOS/Windows) text file involves adding characters, but this does not increase the 382: 364: 269: 644: 519: 444: 243: 20: 753: 708: 698: 676: 585: 539: 481: 296: 264: 164: 800: 344: 551: 527: 427: 416: 184: 159: 154: 713: 694:
Comparison of programming languages (basic instructions)#Data conversions
547: 422:
Data stored in an electronic format can be quickly modified and analyzed.
348: 360: 456:
manipulation of the image can. Converting an image or audio file in a
451:
to a 48-bit one does not add information to it, it only pads existing
668: 435: 374: 35: 622:
applications, versus the structural-descriptive paradigm, found in
149: 615: 531: 447:
by the computer based on previous knowledge. Converting a 24-bit
635: 523: 465: 412:
The computer can add information only in a rule-based fashion.
212: 596: 495:
Automatic restoration of information that was lost through a
754:
Dragos-Anton Manolescu; Markus Voelter; James Noble (2006).
611:, then that approximation does involve loss of information. 23:. For conversion of analog information to digital data, see 631: 623: 564: 492:
Removal" feature of his or her image manipulation program.
477: 461: 431: 627: 555: 485: 452: 369: 217: 207: 351:
to another. Throughout a computer environment, data is
499:
process would probably require important advances in
747: 578: 60:. Unsourced material may be challenged and removed. 647:as I N T R O D U C T I O N on the word processor. 650: 798: 760:. Addison-Wesley Professional. pp. 271–. 630:and many applications derived therefrom, like 321: 415:Upsampling the data or converting to a more 783:. Upper Saddle River, NJ: Addison-Wesley. 328: 314: 16:Conversion of digital data between formats 778: 535:tables down from hundreds to a few tens. 120:Learn how and when to remove this message 719:Distributed Data Management Architecture 618:paradigm, extant in word processors and 799: 781:Pattern Languages of Program Design 5 757:Pattern Languages of Program Design 5 509: 396: 679:can be accomplished when necessary. 58:adding citations to reliable sources 29: 355:in a variety of ways. For example, 13: 14: 818: 579:Lost and inexact data conversion 34: 571:decodes the source file to raw 45:needs additional citations for 662: 651:Open vs. secret specifications 550:file by converting the RTF to 1: 779:Manolescu, FirstName (2006). 740: 730:Source-to-source translation 7: 725:Code conversion (computing) 682: 516:intermediate representation 383:conversion of image formats 25:Analog-to-digital converter 10: 823: 18: 231:Techniques and transforms 480:) or uncompressed (like 405:in mind. These include: 302:Transformation languages 275:Extract, transform, load 194:Transformation languages 501:artificial intelligence 363:checks. Similarly, the 343:is the conversion of 175:Model transformation 54:improve this article 704:Data transformation 657:reverse engineering 281:Web template system 170:Data transformation 136:Data transformation 735:Presentation layer 689:Character encoding 620:desktop publishing 510:Pivotal conversion 403:information theory 397:Information basics 387:audio file formats 379:character encoding 239:Identity transform 767:978-0-321-32194-7 497:lossy compression 357:computer hardware 338: 337: 130: 129: 122: 104: 69:"Data conversion" 814: 794: 772: 771: 751: 365:operating system 330: 323: 316: 270:Data integration 132: 131: 125: 118: 114: 111: 105: 103: 62: 38: 30: 822: 821: 817: 816: 815: 813: 812: 811: 797: 796: 791: 776: 775: 768: 752: 748: 743: 685: 665: 653: 581: 542:may convert an 512: 434:text file to a 426:For example, a 399: 341:Data conversion 334: 260:Data conversion 244:Data refinement 126: 115: 109: 106: 63: 61: 51: 39: 28: 21:Type conversion 17: 12: 11: 5: 820: 810: 809: 789: 774: 773: 766: 745: 744: 742: 739: 738: 737: 732: 727: 722: 716: 711: 709:Data wrangling 706: 701: 699:Data migration 696: 691: 684: 681: 664: 661: 652: 649: 586:Microsoft Word 580: 577: 540:word processor 511: 508: 424: 423: 420: 413: 410: 398: 395: 336: 335: 333: 332: 325: 318: 310: 307: 306: 305: 304: 299: 297:Data wrangling 291: 290: 286: 285: 284: 283: 278: 272: 267: 265:Data migration 262: 254: 253: 249: 248: 247: 246: 241: 233: 232: 228: 227: 226: 225: 220: 215: 210: 205: 197: 196: 190: 189: 188: 187: 182: 177: 172: 167: 165:Data migration 162: 157: 152: 144: 143: 139: 138: 128: 127: 42: 40: 33: 15: 9: 6: 4: 3: 2: 819: 808: 807:Computer data 805: 804: 802: 795: 792: 786: 782: 769: 763: 759: 758: 750: 746: 736: 733: 731: 728: 726: 723: 720: 717: 715: 712: 710: 707: 705: 702: 700: 697: 695: 692: 690: 687: 686: 680: 678: 674: 670: 660: 658: 648: 646: 641: 637: 633: 629: 625: 621: 617: 612: 610: 606: 602: 598: 594: 589: 587: 576: 574: 570: 566: 561: 557: 553: 549: 545: 541: 536: 533: 529: 525: 521: 517: 507: 504: 502: 498: 493: 491: 490:JPEG Artifact 487: 483: 479: 475: 471: 467: 463: 460:format (like 459: 454: 450: 446: 441: 437: 433: 429: 421: 418: 414: 411: 408: 407: 406: 404: 394: 390: 388: 384: 380: 376: 371: 366: 362: 358: 354: 350: 346: 345:computer data 342: 331: 326: 324: 319: 317: 312: 311: 309: 308: 303: 300: 298: 295: 294: 293: 292: 288: 287: 282: 279: 276: 273: 271: 268: 266: 263: 261: 258: 257: 256: 255: 251: 250: 245: 242: 240: 237: 236: 235: 234: 230: 229: 224: 223:XML languages 221: 219: 216: 214: 211: 209: 206: 204: 201: 200: 199: 198: 195: 192: 191: 186: 183: 181: 178: 176: 173: 171: 168: 166: 163: 161: 158: 156: 153: 151: 148: 147: 146: 145: 141: 140: 137: 134: 133: 124: 121: 113: 110:November 2023 102: 99: 95: 92: 88: 85: 81: 78: 74: 71: –  70: 66: 65:Find sources: 59: 55: 49: 48: 43:This article 41: 37: 32: 31: 26: 22: 780: 777: 756: 749: 666: 654: 613: 608: 604: 600: 592: 590: 582: 552:OpenDocument 537: 528:Windows-1251 513: 505: 494: 425: 417:feature-rich 400: 391: 340: 339: 259: 252:Applications 185:Preprocessor 160:Data mapping 155:Data element 116: 107: 97: 90: 83: 76: 64: 52:Please help 47:verification 44: 714:Transcoding 663:Electronics 548:WordPerfect 790:0321321944 741:References 669:line codes 546:file to a 522:text from 445:estimation 428:true color 361:parity bit 80:newspapers 558:image to 436:Microsoft 377:from one 375:text file 347:from one 801:Category 683:See also 671:such as 520:Cyrillic 470:lossless 150:Metadata 142:Concepts 616:WYSIWYG 532:Unicode 468:) to a 440:entropy 353:encoded 289:Related 94:scholar 787:  764:  636:MathML 524:KOI8-R 472:(like 466:Vorbis 349:format 213:MOFM2T 96:  89:  82:  75:  67:  721:(DDM) 597:ASCII 458:lossy 277:(ETL) 180:Macro 101:JSTOR 87:books 785:ISBN 762:ISBN 675:and 634:and 632:HTML 624:SGML 607:and 565:FLAC 478:FLAC 462:JPEG 432:Unix 385:and 370:bits 73:news 673:NRZ 640:PDF 628:XML 595:to 573:PCM 569:AAC 567:to 560:PNG 556:PCX 544:RTF 526:to 486:WAV 484:or 482:BMP 476:or 474:PNG 464:or 453:RGB 449:PNG 218:QVT 208:AWK 203:ATL 56:by 803:: 677:RZ 645:em 626:, 609:ae 601:ae 503:. 389:. 793:. 770:. 605:ä 593:ä 329:e 322:t 315:v 123:) 117:( 112:) 108:( 98:· 91:· 84:· 77:· 50:. 27:.

Index

Type conversion
Analog-to-digital converter

verification
improve this article
adding citations to reliable sources
"Data conversion"
news
newspapers
books
scholar
JSTOR
Learn how and when to remove this message
Data transformation
Metadata
Data element
Data mapping
Data migration
Data transformation
Model transformation
Macro
Preprocessor
Transformation languages
ATL
AWK
MOFM2T
QVT
XML languages
Identity transform
Data refinement

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.