首页> 外国专利> A method and arrangement for automatically recognizing spoken dictionary

A method and arrangement for automatically recognizing spoken dictionary

机译:一种自动识别口语词典的方法和装置

摘要

1,021,226. Automatic speech recognition. INTERNATIONAL BUSINESS MACHINES CORPORATION. Nov. 9, 1962 [Nov. 14, 1961], No. 42414/62. Heading G4R. In a system for recognizing spoken words specific characteristics of speech are identified and there are means for establishing the time sequence of the characteristics in order to provide variable amplitude signals representative of the occurrence of particular spoken words. The characteristics identified are voicing and friction and a suitable circuit is shown in Fig. 2. The input signal, e.g. from a microphone, is applied to the bases of two transistors 20, 21 having their emitters connected to earth. The collectors are connected by resistors, via an R-C smoothing circuit to the input and via A.C. coupling capacitors 23 to a transistor 25. The circuit responds to the asymmetric signals which are produced by a spoken vowel to give a positive or negative signal at output 37 depending upon the vowel spoken. An R-C feedback loop 38 is adjustable to cause the circuit to respond with a positive signal to the vowel of the word " 3 " and with a negative signal to the word " 4." The transistor 25 amplifies the A.C. component of the input signal. This is differentiated at 26 and the zero-crossing pulses applied via diode 27 to an integrator circuit 29 which develops a positive voltage when friction frequency components are present. The amplified signals are further amplified by transistor 40 and again differentiated (at 42) and applied via diode 43 to integrator 44 which develops a negative voltage when friction is present. The integrators 29 and 44 are connected to opposite ends of a potentiometer the arrangement being such that for strong frictional sounds positive integrator 29 develops a strong charge and negative integrator 44 becomes saturated. The arm 31 of the potentiometer is placed so that the positive change of integrator 29 predominates. For weak friction the positive integrator receives a weak charge and the negative integrator, owing to the extra amplifier 40, receives a much greater charge so that it predominates. A positive output therefore indicates a strong frictional sound (e.g. " S ") and a negative output represents a weak frictional sound (" f "). The output from integrator 44 is also connected to one end of a potentiometer 34 connected also to the vowel output lead. An arm 35 gives an output representing voiced speech. The strong or weak friction signals, the voicing signals and signals distinguishing between particular sounds are combined in relay and diode gating circuits having weighted resistors adapted to give an output current representative of the word recognized. This may be registered on a meter 17, Fig. 5. Relays are used to determine the time-sequence of the voicing and friction signals. The relay 77 is operated by a voicing signal to switch contact 76 and draw current through meter 17 to - 12 V. source. Relays 85 and 86 also operate to switch contacts 91 so that if a friction signal Fw or Fs occurs after the voicing signal the relay Fwl or Fsl is operated, signifying " friction late " in the word. If the friction signal. precedes the voicing signal the other relays Fwe or Fse are operated, signifying " friction early." Contacts of these relays energize hold coils and draw predetermined currents through the meter so that the total amount of current indicates which of seven words has been recognized. In the circuit of Fig. 6 the input signals are subjected to a phase shift determined by the resistor 93 of R-C circuit 93, 94. A fixed R-C circuit 96, 97 follows and two unidirectional parallel lines 98, 99 and 100, 101 which serve to cut out signals near the base line. Capacitors 103, 104 are charged by the resulting peaks and give a positive or negative response according to the vowel and the adjustment of the circuit. Particular pairs of words, such as " three " and " four " or " two " and " seven " can be distinguished by suitably adjusted circuits. Plosives, e.g. " t " may be detected by the circuit of Fig. 11 in which a pair of networks 170, 171 constitute a low-pass filter (about 10 cycles) and the signal envelope which passes is amplified at 173 and 174. In Fig. 9 there are three circuits 141- 143 which indicate the presence of particular characteristics and circuits 144-146 distinguish between particular words. The signals are all applied to decision circuits 148 together with timing signals from a word time-base circuit 150 and timing control circuit 151. The former responds to voicing and frictional sound to mark the start of a new word. The circuit 151 controls the operations after a word has finished and it is designed to wait a given interval after the last occurrence of voicing or friction sounds. The output of the decision circuits passes to an adder-printer 152. The time-base circuit may again consist of relays adapted to complete different circuits according to their order of energization and the contacts of these relays may form the decision network giving an output on the appropriate one of sixteen leads (digits 0-9 and six control words). The timing circuit 151 has inputs from the three circuits 141-143 applied via diodes to a relay coil. When all three speech components cease the relay falls out and picks up another relay (sample) adapted to transfer the output to the adderprinter solenoids. When this second relay falls out a third relay energizes to give the reset pulse.
机译:1,021,226。自动语音识别。国际商业机器公司。 1962年11月9日[十一月。 14,1961],编号42414/62。标题G4R。在用于识别口语单词的系统中,识别语音的特定特征,并且存在用于建立这些特征的时间顺序的装置,以便提供代表特定口语单词的出现的可变幅度信号。识别出的特征是发声和摩擦,并且在图2中示出了合适的电路。来自麦克风的信号被施加到两个晶体管20、21的基极,两个晶体管的发射极接地。集电极通过电阻器,通过RC平滑电路连接到输入端,以及通过交流耦合电容器23连接到晶体管25。该电路响应由语音元音产生的非对称信号,在输出端37提供正或负信号取决于所讲的元音。 R-C反馈回路38是可调节的,以使电路以正信号响应单词“ 3”的元音,以负信号响应单词“ 4”。晶体管25放大输入信号的交流分量。这在26处被微分,并且过零脉冲通过二极管27施加到积分器电路29,该积分器电路29在存在摩擦频率分量时产生正电压。放大后的信号被晶体管40进一步放大并再次微分(在42处),并通过二极管43施加到积分器44,积分器44在存在摩擦时产生负电压。积分器29和44连接到电位计的相对端,该布置使得对于强摩擦声,正积分器29产生强电荷,而负积分器44变得饱和。放置电位计的臂31,使得积分器29的正向变化占主导。对于弱摩擦,由于额外的放大器40,正积分器接收到的电荷较弱,而负积分器接收到的电荷要大得多,因此它占主导地位。因此,正输出表示强烈的摩擦声(例如“ S”),而负输出表示弱的摩擦声(“ f”)。积分器44的输出也连接到电位计34的一端,该电位计也连接到元音输出引线。臂35给出代表有声语音的输出。强或弱摩擦信号,发声信号和区分特定声音的信号在继电器和二极管选通电路中组合在一起,该电路具有加权电阻,这些加权电阻适于提供代表所识别单词的输出电流。这可以在图5的仪表17上记录。继电器用于确定发声和摩擦信号的时间顺序。继电器77由声音信号操作,以切换触点76,并通过电表17将电流汲取至-12V电源。继电器85和86还操作来切换触点91,使得如果在发声信号之后出现摩擦信号Fw或Fs,则继电器Fwl或Fsl被操作,这在词中表示“迟摩擦”。如果有摩擦信号。在发声信号之前,其他继电器Fwe或Fse被操作,表示“提前摩擦”。这些继电器的触点为保持线圈通电,并通过电表吸取预定电流,从而总电流表明已识别出七个单词中的哪个。在图6的电路中,输入信号经受由RC电路93、94的电阻器93确定的相移。紧随其后的是固定的RC电路96、97和两条单向平行线98、99和100、101,切断基线附近的信号。电容器103、104由所产生的峰值充电,并根据元音和电路的调整给出正响应或负响应。特定的单词对,例如“三个”和“四个”或“两个”和“七个”可以通过适当调整的电路来区分。爆破药可以由图11的电路检测“ t”,其中一对网络170电路171、171构成低通滤波器(大约10个周期),并且通过的信号包络在173和174处被放大。在图9中,存在三个电路141-143,其指示特定特性的存在,并且电路144-146区分在特定词之间。这些信号全部与来自单词时基电路150和时序控制电路151的时序信号一起加到判决电路148。前者响应发声和摩擦声以标记新单词的开始。电路151在单词结束之后控制操作,并且它被设计成在最后一次发声或摩擦声之后等待给定的间隔。判定电路的输出传递到加法器-打印机152。时基电路可以再次由继电器组成,这些继电器根据其通电顺序适合于完成不同的电路,这些继电器的触点可以形成判定网络,在适当的十六个引线之一(数字0-9和六个控制字)。定时电路151具有来自三个电路141-143的输入,该三个电路141-143经由二极管施加到继电器线圈。当所有三个语音成分都停止时,继电器掉落并拾取另一个继电器(样本),该继电器适于将输出传输到加油机电磁阀。当第二个继电器掉电时,第三个继电器通电以提供复位脉冲。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号