Knowledge

Basis Technology

Source 📝

24: 275:, and gazetteers, lists of special words that can be tuned to the language and text to be analyzed. The tool is designed to work directly with varied alphabets and multiple languages, an advantage because foreign words are often transliterated in multiple ways. It is believed to be the first commercially available tool for analyzing Arabic text. 332:, to help identify and extract clues from data storage devices like hard disks or flash cards, as well as devices such as smart phones and iPods. The open-source licensing model allows them to be used as the foundation for larger projects like a Hadoop-based tool for massively parallel forensic analysis of very large data collections. 335:
The digital forensics tool set is used to perform analysis of file systems, new media types, new file types and file system metadata. The tools can search for particular patterns in the files allowing it to target significant files or usage profiles. It can, for instance, look for common files using
185:
to use artificial intelligence techniques for natural language processing to help computer systems understand written human language. Its software focuses on analyzing freeform text so that applications can do a better job understanding the meaning of the words. For example, their software can
270:
analyzes raw text and identifies the probable role that words and phrases play in the document, a key step that makes it possible for algorithms to distinguish between the various meanings that many words can have. Splitting the raw text into groups of words according to their role and then
186:
identify tokens, part-of-speech, and lemmas. The tools can also identify different forms of names and phrases. The name of someone, say Albert P. Jones for instance, can appear in many different ways. Some texts will call him "Al Jones", others "Mr. Jones" and others "Albert Paul Jons".
211:
BasisTech software is also used by forensic analysts to search through files for words, tokens, phrases or numbers that may be important to investigators, as well as provide software (Cyber Triage) that helps organizations respond to cyberattacks.
244:, and Arabic chat translation. It can be integrated into applications to enhance financial compliance onboarding, communication surveillance compliance, social media monitoring, cyber threat intelligence, and customer feedback analysis. 347:
BasisTech acquired KonaSearch in June 2019, a startup that specializes in search for Salesforce.com and other office database repositories, which can automate the search step of business workflows.
189:
Their software also performs entity extraction, that is finding words which refer to people, places, and organizations from text for uses such as due diligence, intelligence and metadata tagging.
339:
The tools are designed to be customizable with an open plugin architecture. Basis Technology helps manage a large and diverse community of developers who use the tool in investigations.
254:
looks at the structural and statistical signature of the file to identify the language. The pre-configured software can recognize 55 different languages with 45 different encodings.
208:
and other applications. The tool is used to enable search engines to search in multiple languages, and match identities and dates. Rosette was sold to Babel Street in 2002.
762: 434: 220:
Rosette comes as a cloud (public or on-premise) deployment or Java SDK. Rosette provides a variety of natural language processing tools for unstructured text:
757: 624: 307:
Rosette is used in both the United States government offices to support translation and by major Internet infrastructure firms like search engines.
673: 767: 271:
classifying their contribution to meaning is often called entity analysis. The Basis hybrid approach mixes statistical modeling with rules,
182: 641: 287:
enables simple search across name variations either by plugging into open source search engines or as a standalone service.
586: 691: 170:
is a software company specializing in applying artificial intelligence techniques to understanding documents and
406: 543: 193: 654: 264:
after finding the tokens. Search is often faster and more accurate when words are grouped by their stem.
336:
hash functions and also deconstruct the data structures of the important operating system log files.
175: 74: 420: 487: 237: 221: 44: 378: 197: 300: 8: 529: 459: 233: 557: 392: 364: 328: 272: 229: 48: 316: 225: 171: 52: 605: 501: 571: 56: 34: 515: 241: 751: 205: 281:
transliterates non-Latin alphabets like Arabic into a consistent Latin form.
674:"MSN Search Engine Uses Basis Technology for Natural Language Processing" 201: 625:"Language analysis software aids U.S. Web search for terrorist activity" 705: 322: 134: 473: 435:"Babel Street Closes Highly Successful 2022 with Rosette Acquisition" 261: 178:
with a subsidiary office in Tokyo. Its legal name is BasisTech LLC.
87: 732: 247:
The Rosette Linguistics Platform is composed of these modules:
91: 742: 722: 192:
The company is best known for its Rosette product which uses
159: 151: 23: 737: 727: 147: 95: 155: 655:"Basis Technology turns its focus to government security" 174:
written in different languages. It has headquarters in
181:
The company was founded in 1995 by graduates of the
692:"Basis Technology Brings Deep Search to Salesforce" 572:"Understand, Measure, and Act on Consumer Feedback" 421:"Elasticsearch Plugins - Elasticsearch Enrichment" 407:"Elasticsearch Plugins - Elasticsearch Enrichment" 749: 763:Privately held companies based in Massachusetts 558:"A Game-Changing Threat Intelligence Platform" 111:Brian Carrier (CTO and GM Cyber Forensics) 22: 758:Software companies based in Massachusetts 460:"Custom Solutions for Digital Forensics" 652: 393:"Entity Extractor - Entity Recognition" 768:Software companies established in 1995 750: 671: 622: 584: 183:Massachusetts Institute of Technology 606:"Language tools for fight on terror" 603: 310: 228:, name matching, name translation, 113:Simson Garfinkel (Chief Scientist) 109:Steven Cohen (EVP/COO, Co-Founder) 13: 672:Baker, Loren (November 30, 2004). 642:Profile in Boston Business Journal 587:"Translation in the Era of Terror" 297:Rosette Chat Translator for Arabic 14: 779: 738:Autopsy digital forensics website 716: 623:Weiss, Todd R. (March 10, 2003). 653:Hollmer, Mark (March 21, 2003). 604:Boyd, Clark (January 14, 2004). 585:Erard, Michael (March 1, 2004). 293:smooths the use of Unicode text. 291:Rosette Core Library for Unicode 698: 684: 665: 646: 635: 616: 597: 578: 564: 550: 536: 522: 508: 494: 315:BasisTech develops open-source 107:Carl Hoffman (CEO, Co-Founder) 480: 466: 452: 427: 413: 399: 385: 371: 357: 1: 350: 342: 7: 379:"Name Indexer - Name Match" 252:Rosette Language Identifier 194:Natural Language Processing 10: 784: 215: 160:http://www.cybertriage.com 115:Junichi Hasegawa (VP Asia) 176:Somerville, Massachusetts 152:http://www.konasearch.com 143: 133: 119: 101: 81: 75:Somerville, Massachusetts 70: 62: 40: 30: 21: 502:"Rosette Text Analytics" 299:converts words from the 268:Rosette Entity Extractor 260:identifies the lemma or 258:Rosette Base Linguistics 148:http://www.basistech.com 659:Boston Business Journal 279:Rosette Name Translator 238:relationship extraction 222:language identification 196:techniques to improve 156:http://www.autopsy.com 45:Information technology 678:Search Engine Journal 198:information retrieval 733:Cyber Triage website 439:www.businesswire.com 301:Arabic chat alphabet 285:Rosette Name Indexer 240:, topic extraction, 224:, base linguistics, 273:regular expressions 234:semantic similarity 18: 743:KonaSearch website 530:"Société Générale" 488:"Base Linguistics" 365:"Base Linguistics" 230:sentiment analysis 49:Information access 16: 591:Technology Review 317:digital forensics 311:Digital forensics 226:entity extraction 172:unstructured data 165: 164: 53:Digital forensics 775: 723:Official website 710: 709: 702: 696: 695: 688: 682: 681: 669: 663: 662: 650: 644: 639: 633: 632: 620: 614: 613: 601: 595: 594: 582: 576: 575: 568: 562: 561: 554: 548: 547: 540: 534: 533: 526: 520: 519: 512: 506: 505: 498: 492: 491: 484: 478: 477: 470: 464: 463: 456: 450: 449: 447: 446: 431: 425: 424: 417: 411: 410: 403: 397: 396: 389: 383: 382: 375: 369: 368: 361: 26: 19: 15: 783: 782: 778: 777: 776: 774: 773: 772: 748: 747: 728:Rosette website 719: 714: 713: 704: 703: 699: 690: 689: 685: 670: 666: 651: 647: 640: 636: 621: 617: 602: 598: 583: 579: 570: 569: 565: 556: 555: 551: 542: 541: 537: 528: 527: 523: 514: 513: 509: 500: 499: 495: 486: 485: 481: 472: 471: 467: 458: 457: 453: 444: 442: 433: 432: 428: 419: 418: 414: 405: 404: 400: 391: 390: 386: 377: 376: 372: 363: 362: 358: 353: 345: 313: 218: 158: 154: 150: 128: 126: 124: 114: 112: 110: 108: 104: 94: 90: 84: 77:, United States 57:Transliteration 55: 51: 47: 12: 11: 5: 781: 771: 770: 765: 760: 746: 745: 740: 735: 730: 725: 718: 717:External links 715: 712: 711: 697: 683: 664: 645: 634: 615: 596: 577: 563: 549: 535: 521: 507: 493: 479: 465: 451: 426: 412: 398: 384: 370: 355: 354: 352: 349: 344: 341: 312: 309: 305: 304: 294: 288: 282: 276: 265: 255: 242:categorization 217: 214: 206:search engines 163: 162: 145: 141: 140: 137: 131: 130: 121: 117: 116: 105: 102: 99: 98: 85: 82: 79: 78: 72: 68: 67: 64: 60: 59: 42: 38: 37: 32: 28: 27: 9: 6: 4: 3: 2: 780: 769: 766: 764: 761: 759: 756: 755: 753: 744: 741: 739: 736: 734: 731: 729: 726: 724: 721: 720: 707: 701: 693: 687: 679: 675: 668: 660: 656: 649: 643: 638: 630: 629:Computerworld 626: 619: 611: 607: 600: 592: 588: 581: 573: 567: 559: 553: 545: 539: 531: 525: 517: 511: 503: 497: 489: 483: 475: 469: 461: 455: 440: 436: 430: 422: 416: 408: 402: 394: 388: 380: 374: 366: 360: 356: 348: 340: 337: 333: 331: 330: 325: 324: 318: 308: 302: 298: 295: 292: 289: 286: 283: 280: 277: 274: 269: 266: 263: 259: 256: 253: 250: 249: 248: 245: 243: 239: 235: 231: 227: 223: 213: 209: 207: 203: 199: 195: 190: 187: 184: 179: 177: 173: 169: 161: 157: 153: 149: 146: 142: 138: 136: 132: 125:Cyber Triage 122: 118: 106: 100: 97: 93: 89: 86: 80: 76: 73: 69: 65: 61: 58: 54: 50: 46: 43: 39: 36: 33: 29: 25: 20: 700: 686: 677: 667: 658: 648: 637: 628: 618: 609: 599: 590: 580: 566: 552: 538: 524: 510: 496: 482: 468: 454: 443:. Retrieved 441:. 2023-01-10 438: 429: 415: 401: 387: 373: 359: 346: 338: 334: 327: 320: 314: 306: 296: 290: 284: 278: 267: 257: 251: 246: 219: 210: 191: 188: 180: 167: 166: 139:BasisTech GK 135:Subsidiaries 71:Headquarters 31:Company type 202:text mining 123:KonaSearch 83:Area served 752:Categories 706:"About Us" 445:2024-04-11 351:References 343:KonaSearch 323:Sleuth Kit 303:to Arabic. 129:Sleuth Kit 103:Key people 544:"Sensika" 262:word stem 168:BasisTech 17:BasisTech 610:BBC News 516:"Uphold" 127:Autopsy 120:Products 88:Americas 41:Industry 474:"About" 329:Autopsy 319:tools, 216:Rosette 144:Website 63:Founded 35:Private 92:Europe 326:and 321:The 96:Asia 66:1995 754:: 676:. 657:. 627:. 608:. 589:. 437:. 236:, 232:, 204:, 200:, 708:. 694:. 680:. 661:. 631:. 612:. 593:. 574:. 560:. 546:. 532:. 518:. 504:. 490:. 476:. 462:. 448:. 423:. 409:. 395:. 381:. 367:.

Index


Private
Information technology
Information access
Digital forensics
Transliteration
Somerville, Massachusetts
Americas
Europe
Asia
Subsidiaries
http://www.basistech.com
http://www.konasearch.com
http://www.autopsy.com
http://www.cybertriage.com
unstructured data
Somerville, Massachusetts
Massachusetts Institute of Technology
Natural Language Processing
information retrieval
text mining
search engines
language identification
entity extraction
sentiment analysis
semantic similarity
relationship extraction
categorization
word stem
regular expressions

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.