|
1 year ago | |
---|---|---|
slides | 1 year ago | |
visuels | 1 year ago | |
.gitignore | 1 year ago | |
Makefile | 1 year ago | |
NLP-TP3.bib | 1 year ago | |
README.md | 1 year ago | |
analyse_articles.ipynb | 1 year ago | |
commentaires.ipynb | 1 year ago | |
commentaires_reseaux_sociaux.mm | 1 year ago | |
parsing_functions.py | 1 year ago | |
pretraitement.py | 1 year ago | |
rapport.md | 1 year ago | |
textes_articles.ipynb | 1 year ago | |
traitement_articles.ipynb | 1 year ago | |
traitement_commentaires.ipynb | 1 year ago | |
transactions-on-speech-and-language-processing.csl | 1 year ago | |
wn_pos_from_ud_pos.py | 1 year ago |
Cours: IFT-7022 Session: Automne 2019 Auteur: François Pelletier
Installer Anaconda3 depuis https://www.anaconda.com/distribution/#download-section
Installer ces paquets additionnels dans l’environnement
pip install newspaper3k
pip install emoji
pip install git+https://github.com/ClaudeCoulombe/FrenchLefffLemmatizer.git
import nltk
nltk.download('wordnet')
nltk.download('omw')
Télécharger CoreNLP depuis
https://stanfordnlp.github.io/CoreNLP/download.html
stanford-corenlp-full-2018-10-05.zip
stanford-french-corenlp-2018-10-05-models.jar
Exécuter cette commande dans le répertoire de CoreNLP
java -mx4g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer -serverProperties StanfordCoreNLP-french.properties -preload tokenize,ssplit,pos,lemma,ner,parse,depparse -status_port 9000 -port 9000 -timeout 15000 &
make