Currently news items subject classification in Ethiopia is done manually by journalists which is time consuming task (although they are using computer system to store and dispatch information). This research experimented the application of machine learning techniques to automatic categorization of Amharic news items. Machine learning techniques, Naïve Bayes and k Nearest Neighbor classifiers, were used to categorize the Amharic news items. 11, 024 news articles were used to do this research. To come up with good results text preparation and per-processing was done. Stop-word and words that occur in 3 or less documents were removed from the collection. Thirty-three percent of the data was used for testing purposes. The result of this research indicated that such classifiers are applicable to automatically classify Amharic news items. However, the classifiers work well when the categories contain almost evenly distributed news items. The best result obtained is by the naïve Bayes. The result of this research is promising. Nevertheless, additional works are recommended in order to come up with good result.
Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.
Currently news items subject classification in Ethiopia is done manually by journalists which is time consuming task (although they are using computer system to store and dispatch information). This research experimented the application of machine learning techniques to automatic categorization of Amharic news items. Machine learning techniques, Naïve Bayes and k Nearest Neighbor classifiers, were used to categorize the Amharic news items. 11, 024 news articles were used to do this research. To come up with good results text preparation and per-processing was done. Stop-word and words that occur in 3 or less documents were removed from the collection. Thirty-three percent of the data was used for testing purposes. The result of this research indicated that such classifiers are applicable to automatically classify Amharic news items. However, the classifiers work well when the categories contain almost evenly distributed news items. The best result obtained is by the naïve Bayes. The result of this research is promising. Nevertheless, additional works are recommended in order to come up with good result.
The author has about 12 years of experience as an IT instructor/trainer, network administrator, and IT Manager in the private, government, and public sectors which enables him study, develop & maintain a system; train/support users; communicate with IT stakeholders; lead and motivate an IT team to achieve the ever increasing demands of users.
„Über diesen Titel“ kann sich auf eine andere Ausgabe dieses Titels beziehen.
Gratis für den Versand innerhalb von/der Deutschland
Versandziele, Kosten & DauerAnbieter: moluna, Greven, Deutschland
Zustand: New. Dieser Artikel ist ein Print on Demand Artikel und wird nach Ihrer Bestellung fuer Sie gedruckt. Autor/Autorin: Teklu SurafelThe author has about 12 years of experience as an IT instructor/trainer, network administrator, and IT Manager in the private, government, and public sectors which enables him study, develop & maintain a system train/su. Bestandsnummer des Verkäufers 5140449
Anzahl: Mehr als 20 verfügbar
Anbieter: Mispah books, Redhill, SURRE, Vereinigtes Königreich
paperback. Zustand: New. New. book. Bestandsnummer des Verkäufers ERICA82936592167046
Anzahl: 1 verfügbar