The document summarizes a study that evaluated five popular natural language processing (NLP) toolkits (CoreNLP, NLTK, OpenNLP, SparkNLP, and spaCy) on their ability to perform common NLP tasks like sentence segmentation, tokenization, lemmatization, part-of-speech (POS) tagging, and named entity recognition (NER) on news articles in Malay. The study found that CoreNLP achieved the highest accuracy on four tasks, while spaCy was slightly better than CoreNLP for POS tagging. When retraining the NER models on Malaysian entities, CoreNLP and spaCy achieved the highest F-score of 0.78, beating OpenNLP.