会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 7. 发明申请
    • Action selection for reinforcement learning using influence diagrams
    • 使用影响图强化学习的行动选择
    • US20060224535A1
    • 2006-10-05
    • US11169503
    • 2005-06-29
    • David ChickeringTimothy PaekEric Horvitz
    • David ChickeringTimothy PaekEric Horvitz
    • G06F15/18
    • G06N99/005
    • A system and method for online reinforcement learning is provided. In particular, a method for performing the explore-vs.-exploit tradeoff is provided. Although the method is heuristic, it can be applied in a principled manner while simultaneously learning the parameters and/or structure of the model (e.g., Bayesian network model). The system includes a model which receives an input (e.g., from a user) and provides a probability distribution associated with uncertainty regarding parameters of the model to a decision engine. The decision engine can determine whether to exploit the information known to it or to explore to obtain additional information based, at least in part, upon the explore-vs.-exploit tradeoff (e.g., Thompson strategy). A reinforcement learning component can obtain additional information (e.g., feedback from a user) and update parameter(s) and/or the structure of the model. The system can be employed in scenarios in which an influence diagram is used to make repeated decisions and maximization of long-term expected utility is desired.
    • 提供了一种在线强化学习的系统和方法。 特别地,提供了用于执行探索与利用的权衡的方法。 尽管该方法是启发式的,但是它可以以原则的方式应用,同时学习模型的参数和/或结构(例如,贝叶斯网络模型)。 该系统包括接收输入(例如,来自用户)并且向决策引擎提供与关于模型的参数的不确定性相关联的概率分布的模型。 决策引擎可以确定是否利用已知的信息,或者至少部分地基于探索与利用权衡(Thompson策略)来探索获取附加信息。 强化学习组件可以获得附加信息(例如,来自用户的反馈)和更新参数和/或模型的结构。 该系统可用于使用影响图进行重复决策的场景,并期望实现长期预期效用的最大化。
    • 9. 发明申请
    • Speaker-dependent dialog adaptation
    • 与扬声器相关的对话框适应
    • US20060206333A1
    • 2006-09-14
    • US11170998
    • 2005-06-29
    • Timothy PaekDavid ChickeringEric Horvitz
    • Timothy PaekDavid ChickeringEric Horvitz
    • G10L13/00
    • G10L15/22G10L15/07
    • A simulation environment for adapting a speech model (e.g., baseline model) to a user is provided. The user can interact with a base parametric speech model (e.g., statistical model with learnable parameters such as a Bayesian network) and give positive and/or negative feedback when the dialog system has performed what the user considers to be appropriate and/or inappropriate action(s). From the user feedback, the dialog system learns to take actions customized for the particular user. Speaker-dependent adaptation can be extended to the dialog level by performing maximum likelihood linear regression (MLLR) adaptation simultaneously with dialog personalization. Users are immediately able to observe how their feedback has caused the dialog system to adapt, and can quit training whenever they feel that the dialog system has adapted enough for current purposes.
    • 提供了一种用于将语音模型(例如,基准模型)适配到用户的模拟环境。 用户可以与基本参数语音模型(例如,具有诸如贝叶斯网络的可学习参数的统计模型)交互,并且当对话系统执行用户认为是适当的和/或不适当的动作时给出正和负反馈 (s)。 从用户反馈中,对话系统学习采取针对特定用户定制的动作。 通过与对话个性化同时执行最大似然线性回归(MLLR)适应,可以将扬声器依赖的适应扩展到对话级。 用户可以立即观察他们的反馈如何使对话系统适应,并且只要他们觉得对话系统已经足够适应当前的目的,就可以退出训练。
    • 10. 发明申请
    • Using predictive user models for language modeling on a personal device
    • 在个人设备上使用预测用户模型进行语言建模
    • US20070239637A1
    • 2007-10-11
    • US11378024
    • 2006-03-17
    • Timothy PaekDavid Chickering
    • Timothy PaekDavid Chickering
    • G06F15/18
    • G06F17/276G06N99/005G10L15/183G10L15/22G10L2015/0631
    • A system and method for prediction of a user goal for command/control of a personal device (e.g., mobile phone) is provided. The system employs statistical model(s) that can predict a command based, at least in part, on past user behavior (e.g., probability distribution over a set of predicates, and, optionally arguments). Further, the system can be employed with a speech recognition component to facilitate language modeling for predicting the user goal. The system can include predictive user models (e.g., predicate model and argument model) that receive a user input (e.g., utterance) and employ statistical modeling to determine the likely command without regard to the actual content of the input (e.g., utterance). The system employs features for predicting the next user goal which can be stored in a user data store. Features can capture personal idiosyncrasies or systematic patterns of usage (e.g., device-related, time-related, predicate-related, contact-specific and/or periodic features).
    • 提供了一种用于预测用于个人设备(例如,移动电话)的命令/控制的用户目标的系统和方法。 该系统使用至少部分地基于过去的用户行为(例如,一组谓词上的概率分布,以及可选的参数)来预测命令的统计模型。 此外,该系统可以与语音识别组件一起使用以便于用于预测用户目标的语言建模。 该系统可以包括接收用户输入(例如,话语)并且采用统计建模来确定可能的命令而不考虑输入的实际内容(例如,话语)的预测用户模型(例如谓词模型和参数模型)。 该系统采用用于预测可存储在用户数据存储中的下一个用户目标的特征。 特征可以捕获个人特征或系统的使用模式(例如,与设备相关的,与时间相关的,谓词相关的,特定于接触的和/或周期的特征)。