{"id":4122,"date":"2018-01-04T09:41:44","date_gmt":"2018-01-04T07:41:44","guid":{"rendered":"http:\/\/www.laurentmarot.fr\/wordpress\/?p=4122"},"modified":"2018-01-05T18:31:36","modified_gmt":"2018-01-05T16:31:36","slug":"playing-around-with-googl-n-gram","status":"publish","type":"post","link":"https:\/\/www.laurentmarot.fr\/wordpress\/?p=4122","title":{"rendered":"Playing around with Google n-gram"},"content":{"rendered":"<div id=\"attachment_4123\" style=\"width: 310px\" class=\"wp-caption alignleft\"><a href=\"http:\/\/www.laurentmarot.fr\/wordpress\/wp-content\/uploads\/2018\/01\/Capture-du-2018-01-04-08-36-11.png\" rel=\"lightbox[4122]\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-4123\" class=\"size-medium wp-image-4123\" src=\"http:\/\/www.laurentmarot.fr\/wordpress\/wp-content\/uploads\/2018\/01\/Capture-du-2018-01-04-08-36-11-300x133.png\" alt=\"exemple de recherche\" width=\"300\" height=\"133\" srcset=\"https:\/\/www.laurentmarot.fr\/wordpress\/wp-content\/uploads\/2018\/01\/Capture-du-2018-01-04-08-36-11-300x133.png 300w, https:\/\/www.laurentmarot.fr\/wordpress\/wp-content\/uploads\/2018\/01\/Capture-du-2018-01-04-08-36-11-768x341.png 768w, https:\/\/www.laurentmarot.fr\/wordpress\/wp-content\/uploads\/2018\/01\/Capture-du-2018-01-04-08-36-11-1024x455.png 1024w, https:\/\/www.laurentmarot.fr\/wordpress\/wp-content\/uploads\/2018\/01\/Capture-du-2018-01-04-08-36-11.png 1715w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><p id=\"caption-attachment-4123\" class=\"wp-caption-text\">exemple de recherche<\/p><\/div>\n<p><i><b><span class=\"lang-en\" lang=\"en\">Ngram Viewer<\/span><\/b><\/i> est une application linguistique propos\u00e9e par <a title=\"Google\" href=\"https:\/\/fr.wikipedia.org\/wiki\/Google\">Google<\/a>, permettant d\u2019observer l\u2019\u00e9volution de la fr\u00e9quence d\u2019un ou de plusieurs mots ou groupe de mots \u00e0 travers le temps dans les sources imprim\u00e9es (num\u00e9ris\u00e9es par Google).<\/p>\n<p>L\u2019outil est entr\u00e9 en service en 2010 et n&rsquo;a malheureusement plus \u00e9t\u00e9 mis \u00e0 jour depuis 2013.<\/p>\n<p>Le terme N<i><span class=\"lang-en\" lang=\"en\">gram<\/span><\/i> d\u00e9signe dans ce contexte une suite de \u00ab\u00a0n\u00a0\u00bb mots<sup id=\"cite_ref-1\" class=\"reference\"><a href=\"https:\/\/fr.wikipedia.org\/wiki\/Ngram_Viewer#cite_note-1\">1<\/a><\/sup>, ce qui n&rsquo;est li\u00e9 que faiblement \u00e0 la notion de <a title=\"N-gramme\" href=\"https:\/\/fr.wikipedia.org\/wiki\/N-gramme\">n-gramme<\/a>.<\/p>\n<p>L\u2019outil Ngram de Google repose sur la base de donn\u00e9es textuelles de <a title=\"Google Livres\" href=\"https:\/\/fr.wikipedia.org\/wiki\/Google_Livres\">Google Livres<\/a>. Les textes issus de Google Livres sont class\u00e9s en fr\u00e9quence de s\u00e9quences de mots (appel\u00e9es <i>ngrams<\/i>) par ann\u00e9e d\u2019\u00e9dition, chaque s\u00e9quence de mots est alors affect\u00e9e d\u2019un \u00ab\u00a0poids\u00a0\u00bb.<\/p>\n<p>Lorsque l&rsquo;utilisateur demande une comparaison de plusieurs <i>s\u00e9quences de mots<\/i>, l&rsquo;outil trace alors des courbes permettant de comparer leur fr\u00e9quence d&rsquo;usage au cours du temps.<\/p>\n<p>Un exemple de recherche via ce <a href=\"https:\/\/books.google.com\/ngrams\/graph?content=European+Union%2CPalestinian+Authority%2CNorth+Atlantic+Treaty+Organization&amp;case_insensitive=on&amp;year_start=1920&amp;year_end=2000&amp;corpus=15&amp;smoothing=3&amp;share=&amp;direct_url=t4%3B%2CEuropean%20Union%3B%2Cc0%3B%2Cs0%3B%3BEuropean%20Union%3B%2Cc0%3B%3BEuropean%20union%3B%2Cc0%3B%3BEUROPEAN%20UNION%3B%2Cc0%3B.t4%3B%2CPalestinian%20Authority%3B%2Cc0%3B%2Cs0%3B%3BPalestinian%20Authority%3B%2Cc0%3B%3BPalestinian%20authority%3B%2Cc0%3B.t4%3B%2CNorth%20Atlantic%20Treaty%20Organization%3B%2Cc0%3B%2Cs0%3B%3BNorth%20Atlantic%20Treaty%20Organization%3B%2Cc0%3B%3BNORTH%20ATLANTIC%20TREATY%20ORGANIZATION%3B%2Cc0\">lien<\/a> et pour les amoureux du Google Search, le d\u00e9tail d&rsquo;une requ\u00eate constitutive :<\/p>\n<p>https:\/\/www.google.fr\/search?<\/p>\n<p>q=%22european+union%22&amp;<\/p>\n<p>tbm=bks&amp;<\/p>\n<p>tbs=cdr:1,cd_min:2000,cd_max:2000&amp;<\/p>\n<p>lr=lang_en<\/p>\n<p>&amp;gws_rd=cr<\/p>\n<p>&amp;dcr=0<\/p>\n<p>&amp;ei=QtpNWrHdB8zUkwWtoKGIBg<\/p>\n<p>&nbsp;<\/p>\n\n","protected":false},"excerpt":{"rendered":"<p>Ngram Viewer est une application linguistique propos\u00e9e par Google, permettant d\u2019observer l\u2019\u00e9volution de la fr\u00e9quence d\u2019un ou de plusieurs mots ou groupe de mots \u00e0 travers le temps dans les sources imprim\u00e9es (num\u00e9ris\u00e9es par Google). L\u2019outil est entr\u00e9 en service en 2010 et n&rsquo;a malheureusement plus \u00e9t\u00e9 mis \u00e0 jour depuis 2013. Le terme Ngram [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[39],"tags":[],"_links":{"self":[{"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/4122"}],"collection":[{"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=4122"}],"version-history":[{"count":5,"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/4122\/revisions"}],"predecessor-version":[{"id":4129,"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/4122\/revisions\/4129"}],"wp:attachment":[{"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=4122"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=4122"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.laurentmarot.fr\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=4122"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}