首页> 外文会议>International Conference on Informatics and Computing >Developing Indonesian corpus of pornography using simple NLP-text mining (NTM) approach to support government anti-pornography program
【24h】

Developing Indonesian corpus of pornography using simple NLP-text mining (NTM) approach to support government anti-pornography program

机译:使用简单的NLP文本挖掘(NTM)方法开发印度尼西亚色情语料库,以支持政府反色情计划

获取原文

摘要

The world of information technology and telecommunication advanced with the presence of the development the internet. With the emergence of the internet, pornography is easily obtained. Pornography in Indonesia is considered illegal because contrary to laws prevailing in Indonesia. Pornography also having the impact on bad at society under the age of one of them is how many children under age already have sexual intercourse. Approach done is taking the title from / content video pornography that is on a web page an (URL). Methods used namely semiautomatic where this method uses the method of manual and automatic and used K-Nearest Neighbor algorithm. K-Nearest Neighbor algorithm is one the algorithm can be utilized for implementation classification. With K-Nearest Neighbor algorithm can classify data that belongs to a porno or not. Tools used is web corp and programming PHP language. The manufacture of limitation from corpus this is build corpus pornography Indonesian language and data taken to object research site is 10 pornography who can access via smartphone.
机译:信息技术和电信世界通过发展互联网发展。随着互联网的出现,色情很容易获得。印度尼西亚的色情被认为是非法的,因为与印度尼西亚普遍存在的法律相反。在其中一个人的社会下,色情片也对社会的影响是有多少年龄在年龄的儿童已经发生性交。完成方法是从/内容视频色情内提取的标题,这些色情在网页AN(URL)上。方法使用手动和自动和使用的k最近邻算法的方法的半自动。 k-collect邻算法是算法可以用于实现分类的算法。 k-inceltberank algorithm可以对属于Porno的数据进行分类。使用的工具是Web Corp和编程PHP语言。从语料库的制造这是构建语料库色情印度尼西亚语言和对象研究网站的数据是10个可以通过智能手机访问的色情内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号