{"id":90,"date":"2020-12-03T09:19:10","date_gmt":"2020-12-03T09:19:10","guid":{"rendered":"http:\/\/dmslab.hkg03.bdysite.com\/?page_id=90"},"modified":"2023-01-17T15:29:39","modified_gmt":"2023-01-17T15:29:39","slug":"keyword-extraction","status":"publish","type":"page","link":"http:\/\/dmslab.net\/?page_id=90","title":{"rendered":"Keyword extraction"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\"><strong>Keyword extraction by entropy difference between the intrinsic and extrinsic mode<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">We strive to propose a new metric to evaluate and rank the relevance  of words in a text. The method uses the Shannon\u2019s entropy difference  between the intrinsic and extrinsic mode, which refers to the fact that  relevant words significantly reflect the author\u2019s writing intention,  i.e., their occurrences are modulated by the author\u2019s purpose, while the  irrelevant words are distributed randomly in the text. By using The  Origin of Species by Charles Darwin as a representative text sample, the  performance of our detector is demonstrated and compared to previous  proposals. Since a reference text \u2018\u2018corpus\u2019\u2019 is all of an author\u2019s  writings, books, papers, etc. his collected works is not needed. Our  approach is especially suitable for single documents of which there is  no a priori information available.<\/p>\n\n\n\n<figure class=\"wp-block-video\"><video height=\"720\" style=\"aspect-ratio: 1280 \/ 720;\" width=\"1280\" controls src=\"http:\/\/dmslab.hkg03.bdysite.com\/wp-content\/uploads\/2021\/11\/\u5173\u952e\u8bcd\u63d0\u53d6\u5de5\u4f5c\u5f55\u5236.mp4\"><\/video><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Project Members<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\"><li>Zhen Yang\n<\/li><li>Weitong Chen\n<\/li><li>Hanchen Li\n<\/li><li>Chaoyang Li\n<\/li><li>Ning Lu\n<\/li><li>Longbo Zhang\n<\/li><li>Youjun E\n<\/li><\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Highlights<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\"><li>We propose a new metric to evaluate and rank the relevance of words in a text.\n<\/li><li>The metric uses the Shannon\u2019s entropy difference between the intrinsic and extrinsic mode.\n<\/li><li>We believe that this work is a new result in keyword extraction and ranking.\n<\/li><li>Our approach is especially suitable for single documents of which there is no a priori information available.\n<\/li><\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Publication<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\"><li><a href=\"http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0378437113004949\">[2013]\n Yang Z, Lei J, Fan K, Lai Y. \u201cKeyword Extraction by Entropy Difference \nBetween the Intrinsic and Extrinsic Mode.\u201d Physica A: Statistical \nMechanics and its Applications 392(19): 4523-4531.<\/a>\n<\/li><\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Code &amp; Toolbox<\/strong><\/h3>\n\n\n\n<ul class=\"wp-block-list\"><li><strong><a href=\"\/\/dms-research.awecoder.site\/\"><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-vivid-red-color\">Online Demo 1!<\/mark><\/a><\/strong><\/li><li><strong><a href=\"http:\/\/demo.yzlab.net:8090\">Online Demo 2!<\/a><\/strong> <\/li><li><a href=\"https:\/\/github.com\/fromskyblue\/Keywords-Extraction\">Github page<\/a> <\/li><li><a href=\"https:\/\/www.codeproject.com\/Articles\/643619\/Keyword-Extraction-Based-On-Entropy-Difference\">Codeproject page<\/a> <\/li><li><a href=\"http:\/\/ace.autotutor.org\/downloads\/sprout.1.0.0.0.zip\">SPROUT toolbox<\/a>, developed by <a href=\"https:\/\/yzlab.net\/zcai.autotutor.org\">Prof. Zhiqiang Cai<\/a>,  which use our algorithm to extract keywords for target corpus, and then  use the keywords to find extra articles on wiki to expand the corpus. <\/li><\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Update<\/strong><\/h3>\n\n\n\n<ol class=\"wp-block-list\"><li>\u5f20\u9f99\u4f2f. \u57fa\u4e8e\u591a\u5c3a\u5ea6\u5212\u5206\u7684\u5173\u952e\u8bcd\u68c0\u6d4b\u7b97\u6cd5, \u5317\u4eac\u5de5\u4e1a\u5927\u5b66\u7855\u58eb\u5b66\u4f4d\u8bba\u6587\uff0c2014.<br>  \u5f20\u9f99\u4f2f. \u57fa\u4e8e\u591a\u5c3a\u5ea6\u5212\u5206\u7684\u5173\u952e\u8bcd\u68c0\u6d4b\u7cfb\u7edf, \u8ba1\u7b97\u673a\u8f6f\u8457(\u767b\u8bb0\u53f7: 2014SRBJ0226)\uff0c2014. <ul><li>\u5728YANG\u2019 13\u7b97\u6cd5\u7684\u57fa\u7840\u4e0a\u52a0\u5165\u591a\u5c3a\u5ea6\u5206\u6790\u65b9\u6cd5\u3002 <\/li><li>\u5bf9\u6587\u7ae0\u8fdb\u884c\u591a\u5c3a\u5ea6\u5212\u5206\u7684\u65b9\u6cd5\uff0c\u7efc\u5408\u8003\u8651\u8bcd\u8bed\u5728\u5404\u4e2a\u7c92\u5ea6\u4e0b\u7684\u5206\u5e03\u7279\u6027\uff0c\u8ba1\u7b97\u8bcd\u8bed\u7684\u4e3b\u9898\u76f8\u5173\u5ea6\uff0c\u4ece\u800c\u6709\u6548\u7684\u68c0\u6d4b\u51fa\u6587\u672c\u4e2d\u7684\u5173\u952e\u8bcd\u3002 <\/li><li>\u5bf9\u6587\u7ae0\u300a\u7269\u79cd\u8d77\u6e90\u300b\u8fdb\u884c\u5173\u952e\u8bcd\u68c0\u6d4b\uff0c\u6027\u80fd\u660e\u663e\u63d0\u5347\uff0c\u5f97\u5230top19\u51c6\u786e\u7387100%\u7684\u6027\u80fd\u3002   <\/li><\/ul><\/li><\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Keyword extraction by entropy difference between the in &hellip; <a href=\"http:\/\/dmslab.net\/?page_id=90\" class=\"more-link\">\u7ee7\u7eed\u9605\u8bfb<span class=\"screen-reader-text\">\u201cKeyword extraction\u201d<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":62,"menu_order":0,"comment_status":"open","ping_status":"closed","template":"","meta":{"footnotes":"","_wp_rev_ctl_limit":""},"class_list":["post-90","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"http:\/\/dmslab.net\/index.php?rest_route=\/wp\/v2\/pages\/90","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/dmslab.net\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/dmslab.net\/index.php?rest_route=\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/dmslab.net\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/dmslab.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=90"}],"version-history":[{"count":4,"href":"http:\/\/dmslab.net\/index.php?rest_route=\/wp\/v2\/pages\/90\/revisions"}],"predecessor-version":[{"id":2293,"href":"http:\/\/dmslab.net\/index.php?rest_route=\/wp\/v2\/pages\/90\/revisions\/2293"}],"up":[{"embeddable":true,"href":"http:\/\/dmslab.net\/index.php?rest_route=\/wp\/v2\/pages\/62"}],"wp:attachment":[{"href":"http:\/\/dmslab.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=90"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}