【24h】

Bayan: an Arabic text database management system

机译:巴彦:阿拉伯文本数据库管理系统

获取原文

摘要

Most existing databases lack features which allow for the convenient manipulation of text. It is even more difficult to use them if the text language is not based on the Roman alphabet. The Arabic language is a very good example of this case. Many projects have attempted to use conventional database systems for Arabic data manipulation (including text data), but because of Arabic's many differences with English, these projects have met with limited success. In the Bayan project, the approach has been different. Instead of simply trying to adopt an environment to Arabic, the properties of the Arabic language were the starting point and everything was designed to meet the needs of Arabic, thus avoiding the shortcomings of other projects. A text database management system was designed to overcome the shortcomings of conventional database management systems in manipulating text data. Bayan's data model is based on an object-oriented approach which helps the extensibility of the system for future use. In Bayan, we designed the database with the Arabic text properties in mind. We designed it to support the way Arabic words are derived, classified, and constructed. Furthermore, linguistic algorithms (for word generation and morphological decomposition of words) were designed, leading to a formalization of rules of Arabic language writing and sentence construction. A user interface was designed on top of this environment. A new representation of the Arabic characters was designed, a complete Arabic keyboard layout was created, and a window-based Arabic user interface was also designed.

机译:

大多数现有数据库缺少允许方便地操作文本的功能。如果文本语言不是基于罗马字母,则使用它们会更加困难。阿拉伯语是这种情况的一个很好的例子。许多项目尝试使用常规的数据库系统来处理阿拉伯数据(包括文本数据),但是由于阿拉伯语与英语的众多差异,这些项目取得的成功有限。在Bayan项目中,方法有所不同。阿拉伯语言的属性不是开始尝试采用阿拉伯语的环境,而是起点,并且所有内容都旨在满足阿拉伯语的需求,从而避免了其他项目的缺点。设计了文本数据库管理系统,以克服常规数据库管理系统在处理文本数据方面的缺点。 Bayan的数据模型基于一种面向对象的方法,该方法有助于系统的可扩展性以备将来使用。在Bayan,我们在设计数据库时考虑了阿拉伯文本属性。我们设计它来支持阿拉伯单词的派生,分类和构造方式。此外,设计了语言学算法(用于单词生成和单词形态分解),从而使阿拉伯语写作和句子构造规则正式化。用户界面是在此环境之上设计的。设计了一种新的阿拉伯字符表示形式,创建了完整的阿拉伯语键盘布局,还设计了基于窗口的阿拉伯语用户界面。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号