ÿþ<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <title>IntuView Ltd.</title> <meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1" /> <link href="css/style.css" rel="stylesheet" type="text/css" /> <link href="css/dropdown/dropdown.css" media="all" rel="stylesheet" type="text/css" /> <link href="css/dropdown/dropdown.vertical.css" media="all" rel="stylesheet" type="text/css" /> <link href="css/dropdown/themes/default/default.ultimate.css" media="all" rel="stylesheet" type="text/css" /> </head> <body> <div id="wrapper"> <div id="wrapperi"> <div id="wrapperj"> <h1 id="header"><a href="default.aspx"><img src="images/int.jpg" alt="IntuView Ltd." /></a></h1> <div id="left1"> <br /><br /> <ul id="nav" class="dropdown dropdown-vertical"> <li><span class="dir">About Us</span> <ul> <li><a href="profile.html"><b>Company Overview</b></a></li> <li><a href="team.html"><b>Management Team</b></a></li> <li><a href="partners.html"><b>Our Partners</b></a></li> </ul> </li> <li><span class="dir">Products & Solutions</span> <ul> <li><a href="products.html"><b>Overview</b></a></li> <li><a href="intuscan-platform.html"><b>IntuScan"! Platform</b></a></li> <li><a href="ivstation.html"><b>IntuScan Station"!</b></a></li> <li><a href="bloginspector.html" class="blue"><b>IntuScan"! Blog Inspector</b></a></li> <li><a href="crawler.html" class="blue"><b>IntuScan"! Smart Crawler</b></a></li> <li><a href="intuscan-Name Matcher.html"><b>IntuScan Virtual Entity Creator and Name Matcher"!</b></a></li> <li><a href="IED.html"><b>IntuScan IED Recipe Identifier"!</b></a></li> <li><a href="ner.html"><b>IntuScan Named Entity Recognizer"!</b></a></li> <li><a href="restricted/content.aspx"><b>Knwoledge Base Packages</b></a></li> </ul> </li> <li><span class="dir">Services</span> <ul> <li><a href="restricted/services.aspx"><b>Customizations</b></a></li> <li><a href="restricted/training.aspx"><b>Training</b></a></li> <li><a href="download.html"><b>Download Center</b></a></li> </ul> </li> <li><a href="news.html"><b>News</b></a></li> <li><a href="careers.html"><b>Careers</b></a></li> <li><a href="contact.html"><b>Contact Us</b></a></li> </ul> <div class="clear"></div> <div class="clear"></div> <br /><br /> <font color="#006400" size="2"><b>Related Information</b></font> <div id="extensions"> <br /> <div id="bullets"> <ul> <li><a href="restricted/technology.aspx"> The IntuScan"! Technology</a></li> <li><a href="eval.aspx"> Request Evaluation</a></li> </ul> </div> <br /> <b>Technical Information</b> <table id="sidetable" > <tr> <td > <table> <tr> <td><b>Supported Transliteration Systems:</b></td> </tr> <tr> <td>IC, BGN, SATTS</td> </tr> <tr> <td><br /></td> </tr> <tr> <td><b>Supported Languages:</b></td> </tr> <tr> <td>Arabic. Planned to be supported soon - Indonesian/Malay, Farsi, Urdu, Pashtu.</td> </tr> <tr> <td><br />API is available for Java, C++ and .NET</td> </tr> </table> </td> </tr> </table> </div> <br /> </div> <div id="right1"> <div id="topnavigation"> <a href="default.aspx">Home </a> > NER </div> <div id="maincontenttitle"> IntuScan"! Named Entity Recognizer (NER) </div> <div id="maincontent"> IntuView has developed a propriety algorithm for Contextual Information Tagged Entity Identification. This technology takes advantage of IntuView's own language NLP (Natural Language Processing) algorithms to extract entities mentioned in the document and to determine whether they are: <br /><b>people, places, institutions or organizations, URLs and dates (Gregorian, Jewish, Farsi, Hijri).</b> <br /> <br /><br /> IntuView's algorithms do not only identify and link together entities which appear in the same form in the same document but identifies the same entity when it crops up in different documents in the same batch, and even in variant forms. <br /><br />For example, a person who is mentioned in a set of documents in different forms as follows: <br /><br /><br /> <div id="box"> <div id="bullets-narrow"> <ul> <li> Mujahid Sheikh Ahmad Yousuf Muhammad</li> <li> Sheikh Abu Yousuf , Leader of the Islamic Army </li> <li> Brother Mujahid Abu Yousuf Ahmad, MAY ALLAH PRESERVE HIM <br /> <font size="1">(document date 1.2.2009)</font> </li> <li> Our Leader, Sheikh Ahmad Yousuf Muhammad, MAY ALLAH HAVE COMPASSION ON HIM <br /><font size="1">(document date 1.4.2009, document identified as belonging to the Islamic Army)</font> </li> </ul> </div> <br /><br />Will all create the entity: <b>Ahmad Yousuf Muhammad</b>, described in the texts as Mujahid, Sheikh, Brother (in reference to the Islamic Army), Leader of the Islamic Army. His ideological orientation is jihadi-salafi and he is associated with the Islamic Army. He appears to have been alive on 1.2.2009 but deceased before 1.4.2009. <br /><br /><br /> </div> <br /><br /><br /> Similarly, the following names extracted from the document: <br /><br /><br /> <div id="box"> <div id="bullets-narrow"> <ul> <li>Sheikh 'Abdallah 'Azzam</li> <li>Brother Sheikh 'Abdallah 'Azzam, founder of the Jihadi Movement</li> <li>Sheikh Martyr 'Abdallah Yousuf 'Azzam, MAY ALLAH HAVE COMPASSION ON HIM</li> <li>Martyr Sheikh Abu Yousuf 'Abdallah MAY ALLAH HAVE COMPASSION ON HIM</li> </ul> </div> <br /><br /> All will be presented in the report as: <b>Abdallah Yousef 'Azzam</b>, AKA Abu Yousef 'Abdallah who is titled in the document as Sheikh, Brother, Sheikh Mujahid, Martyr Sheikh and is described as "the founder of the Jihadi movement". </div> <br /><br /><br /> The information gleaned from different occurrences of the person  even under variant names  is aggregated into an incrementally comprehensive picture of the person  his titles, his sect, his political orientation, etc. This "ad-hoc" person is added to the IntuScan knowledge base for further reference in other texts. <br /><br /><br /> <b>Automatic Transliteration</b><br /> Automatic transliteration of named entities in Arabic and other relevant languages, is another challenging task. The fact that Arabic names are written without diacritics, makes this task even more difficult. We use sophisticated rule based algorithm for dealing with different complicated cases. Beside the internal IntuView's transliteration system, IntuScan"! Named Entity Recognizer supports the following standard transliteration systems:<br /> <b>IC, BGN, SATTS</b>. <br /><br /><br /> </div> </div> <div class="clear"></div> </div> </div> <div class="clear"></div> <div id="footer"><div id="footeri"> <span class="copyright">© 2009 IntuView Ltd.</span> <a href="default.aspx">Home</a> &nbsp; <a href="restricted/services.aspx">Services</a> &nbsp; <a href="products.html">Products</a> &nbsp; <a href="contact.html">Contact</a> &nbsp; </div></div> </div> </div> </div> <script type="text/javascript"> var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www."); document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E")); </script> <script type="text/javascript"> try { var pageTracker = _gat._getTracker("UA-9565407-1"); pageTracker._trackPageview(); } catch(err) {} </script> </body> </html>