{"id":27,"date":"2017-05-20T16:31:35","date_gmt":"2017-05-20T16:31:35","guid":{"rendered":"https:\/\/www.ryanboyd.io\/software\/tapa\/?page_id=27"},"modified":"2020-09-05T06:58:17","modified_gmt":"2020-09-05T06:58:17","slug":"dictionaries","status":"publish","type":"page","link":"https:\/\/www.ryanboyd.io\/software\/tapa\/dictionaries\/","title":{"rendered":"Dictionaries"},"content":{"rendered":"<p>Several dictionaries exist that can be loaded into TAPA. If you have your own dictionary that you would like to make available to others, please <a href=\"m&#97;&#105;&#x6c;&#x74;&#x6f;:r&#121;&#97;&#x6e;&#x62;&#x6f;y&#100;&#64;&#117;&#x74;&#x65;&#x78;a&#115;&#46;&#x65;&#x64;&#x75;?subject=TAPA\">send me an e-mail<\/a> to have it uploaded to this site.<\/p>\n<p>A couple of brief notes about making your own dictionary file:<\/p>\n<ul>\n<li>The first column on the left should always have the header &#8220;Symbol&#8221;<\/li>\n<li>The second column should always have the header &#8220;Type&#8221;\n<ul>\n<li>The &#8220;Type&#8221; column can be filled with one of two values, corresponding to the type of symbol that is being coded (words or characters). The symbol types are as follows for words and characters, respectively:\n<ul>\n<li>word<\/li>\n<li>char<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>If you have any questions about making a dictionary file, please refer to the files that are available below for reference. Additionally, feel free to send me an e-mail if I can be of any assistance.<\/p>\n<hr \/>\n<p><strong>*Note<\/strong>: The Warriner et al. ratings found below are the same that are loaded into the software by default.<\/p>\n<p><strong>**Note:<\/strong>\u00a0For all of the dictionary files on this page, you may want to set TAPA&#8217;s &#8220;Text Encoding&#8221; option to <strong>utf-8<\/strong> when reading the files in. Your system&#8217;s default encoding may also be fine.<\/p>\n<hr \/>\n<p><span style=\"text-decoration: underline;\">Dictionary Files <\/span><\/p>\n<p><span style=\"color: #ff0000;\">Right Click and &#8220;Save Link As&#8230;&#8221; to download a dictionary\u00a0file. Alternatively, click the &#8220;download&#8221; link, then copy and paste the dictionary contents into a .txt file on your hard drive.<\/span><\/p>\n<p style=\"padding-left: 30px;\"><strong>Bestgen &amp; Vincze (2012)<\/strong> &#8211; DIC-LSA norms (<a href=\"https:\/\/link.springer.com\/article\/10.3758\/s13428-012-0195-z\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Norms_DIC-LSA.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Buchanan et al. (2012)\u00a0<\/strong>&#8211; Single word norms (<a href=\"http:\/\/wordnorms.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Buchanan%20et%20al.%20Single%20Word%20Norms.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Brysbaert, Warriner, &amp; Kuperman (2014)<\/strong> &#8211; Concreteness norms (<a href=\"http:\/\/crr.ugent.be\/archives\/1330\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Brysbaert%20et%20al%20Concreteness%20Norms.txt\">download<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Brysbaert%20et%20al%20Concreteness%20Norms%20-%20Rescaled.txt\">rescaled version<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Brysbaert et al. (2014) &#8211;\u00a0<\/strong>Concreteness norms for Dutch (<a href=\"http:\/\/crr.ugent.be\/archives\/1602\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Brysbaert%20et%20al%20Dutch%20Concreteness.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong><span class=\"authors__name\">Engelthaler &amp;\u00a0<\/span>Hills (2017)<\/strong> &#8211; Humor norms (<a href=\"https:\/\/link.springer.com\/article\/10.3758\/s13428-017-0930-6\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/HumorNorms.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Kov\u00e1cs, Carroll, &amp; Lehman (2013)\u00a0&#8211;\u00a0<\/strong>Authenticity norms (<a href=\"http:\/\/pubsonline.informs.org\/doi\/pdf\/10.1287\/orsc.2013.0843\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Authenticity.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Kuperman et al. (2012)<\/strong> &#8211; Age of Acquisition norms (<a href=\"http:\/\/crr.ugent.be\/archives\/806\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Kuperman%20et%20al%20AoA%20Norms.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Kwan et al. &#8211;\u00a0<\/strong>Embodiment ratings for 687 English verbs (<a href=\"http:\/\/psychology.ucalgary.ca\/languageprocessing\/node\/22\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/kwan_et_al_verb_embodiment_ratings.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Lynott &amp; Connell (2009)<\/strong> &#8211; Adjective Modality norms (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/19363198\/\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Lynott-BRM-2009%20Modality%20Norms.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Lynott &amp; Connell (2013)<\/strong> &#8211; Noun Modality norms (<a href=\"https:\/\/link.springer.com\/article\/10.3758%2Fs13428-012-0267-0\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Lynott%20%26%20Connell%202013%20Noun%20Modality%20Norms.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Lynott &amp; Connell<\/strong> &#8211; Adjective and Noun Modality norms combined (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Lynott%20Adjective%20and%20Noun%20Modality%20Norms%20Combined.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Paetzold &amp; Specia (2016) &#8211;\u00a0<\/strong>Psycholinguistic Properties of Words (<a href=\"http:\/\/www.aclweb.org\/anthology\/N16-1050\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Bootstrapped_Psycholinguistic_Features.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Stadthagen-Gonzalez et al. (2016)<\/strong>\u00a0&#8211; Spanish Emotional Norms (<a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed\/26850056\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Stadthagen%20et%20al%20Spanish%20Emotional%20Norms.txt\">download<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Warriner et al. (2013)<\/strong> &#8211; Affective rating norms (<a href=\"http:\/\/crr.ugent.be\/archives\/1003\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>) (<a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/_dictionaries\/Ratings_Warriner_et_al.txt\">download<\/a>)<\/p>\n<hr \/>\n<h3><span style=\"text-decoration: underline;\">Pre-trained\u00a0<\/span><span style=\"text-decoration: underline;\">Word Vectors<\/span><\/h3>\n<p><strong><span style=\"color: #ff0000;\">Note<\/span><span style=\"color: #ff0000;\">: These dictionaries consist of extremely large numbers of words and dimensions. All of the following files are extremely large and are <span style=\"text-decoration: underline;\">extremely memory-intensive<\/span>. You will most likely need at least 32GB of RAM in your computer to use any of the following files. Some may require 64+ GB of RAM.<\/span><\/strong><\/p>\n<p>You may also consider downloading the &#8220;first 100K&#8221; or &#8220;first 500k&#8221; versions of these dictionary files. They are shortened versions of the full dictionaries and may be more viable on less cutting-edge systems.<\/p>\n<p style=\"padding-left: 30px;\"><strong>Pennington et al. (2014)<\/strong>\u00a0&#8211; GloVe pre-trained vectors (<a href=\"https:\/\/nlp.stanford.edu\/projects\/glove\/\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>)<\/p>\n<p style=\"padding-left: 60px;\">\u21d2 Wikipedia 2014 + Gigaword 5, 100-dimensional version, de-duplicated (<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-glove.6B.100d-deduped.txt.zip\">download<\/a>)<\/p>\n<p style=\"padding-left: 60px;\">\u21d2 Twitter, 25 through 200 dimension versions, cleaned (<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-glove.twitter.27B-clean.zip\">download<\/a>)\u00a0(<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-glove.twitter.27B-clean_first_500K.zip\">first 500K words version<\/a>) (<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-glove.twitter.27B-clean_first_100K.zip\">first 100K words version<\/a>)<\/p>\n<p style=\"padding-left: 60px;\">\u21d2 Common Crawl, 42B token version, 300 dimensions, uncased, cleaned<br \/>\n(<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-glove.42B.300d-uncased.zip\">download<\/a>)\u00a0(<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-glove.42B.300d-uncased_first_500K.zip\">first 500K words version<\/a>)\u00a0(<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-glove.42B.300d-uncased_first_100K.zip\">first 100K words version<\/a>)<\/p>\n<p style=\"padding-left: 30px;\"><strong>Salle (2016) &#8211;<\/strong>LexVec pre-trained vector (<a href=\"https:\/\/github.com\/alexandres\/lexvec\" target=\"_blank\" rel=\"noopener noreferrer\">link<\/a>)<\/p>\n<p style=\"padding-left: 60px;\">\u21d2 Common Crawl, 58B tokens, 300 dimensions, word vectors (<a href=\"https:\/\/www.pancakes.wtf\/tapa\/TAPA-lexvec.commoncrawl.300d.W.pos.neg3.vectors.7z\">download<\/a>)<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Several dictionaries exist that can be loaded into TAPA. If you have your own dictionary that you would like to make available to others, please send me an e-mail to have it uploaded to this site. A couple of brief notes about making your own dictionary file: The first column on the left should always &hellip; <a href=\"https:\/\/www.ryanboyd.io\/software\/tapa\/dictionaries\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Dictionaries&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":2,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0},"_links":{"self":[{"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/pages\/27"}],"collection":[{"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/comments?post=27"}],"version-history":[{"count":44,"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/pages\/27\/revisions"}],"predecessor-version":[{"id":123,"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/pages\/27\/revisions\/123"}],"wp:attachment":[{"href":"https:\/\/www.ryanboyd.io\/software\/tapa\/wp-json\/wp\/v2\/media?parent=27"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}