:warning: 结帐分支以查看pyannote.audio 2.0 中的内容: 一个更小更干净的代码库 Python-first API(不过,旧的pyannote-audio CLI 仍然可用) 多 GPU 和 TPU 训练 使用数据增强 模特主持 音频注释的食谱 基于在线 使用pyannote-audio神经说话者分类 pyannote.audio是一个用 Python 编写的用于说话人分类的开源工具包。 基于机器学习框架,它提供了一组可训练的端到端神经构建块,可以组合和联合优化来构建说话人分类管道: pyannote.audio还带有涵盖了语音活动检测、说话人变化检测、重叠语音检测和说话人嵌入的广泛领域: 安装 pyannote.audio在 Linux 和 macOS 上仅支持 Python 3.7(或更高版本)。 它可能适用于 Windows,但没有任何保证,也没有任何计划添加对
1
Bishop - Pattern Recognition And Machine Learning - Springer 2006
2022-03-17 20:41:37 15.91MB Pattern Recognition Machine Learning
1
Pocketsphinx Python Pocketsphinx是语音识别开源工具包的一部分。 该软件包为使用和创建的CMU 和库提供了python接口。 支持平台 视窗 Linux Mac OS X 安装 # Make sure we have up-to-date versions of pip, setuptools and wheel python -m pip install --upgrade pip setuptools wheel pip install --upgrade pocketsphinx 提供更多用于手动安装的二进制发行版。 用法 现场语音 这是一个迭代器类,用于从麦克风进行连续识别或关键字搜索。 from pocketsphinx import LiveSpeech for phrase in LiveSpeech (): print ( phr
2022-03-16 20:12:37 34KB python voice speech speech-recognition
1
橙果识别的图像分割 为了识别橙色水果,使用了边缘检测和颜色检测方法。 对于这种方法,我使用了图像分割。 输入图像是在不同照明条件下捕获的桔子的图像,并使用图像分割来检测图像的颜色。 用Python完成的实现。 在该系统中,用户将输入橙色图像。 模型会将图像从RGB转换为灰度图像以进行进一步处理。
2022-03-16 12:17:33 109.63MB Python
1
The dramatic growth in practical applications for machine learning over the last ten years has been accompanied by many important developments in the underlying algorithms and techniques. For example, Bayesian methods have grown from a specialist niche to become mainstream, while graphical models have emerged as a general framework for describing and applying probabilistic techniques. The practical applicability of Bayesian methods has been greatly enhanced by the development of a range of approximate inference algorithms such as variational Bayes and expectation propagation, while new models based on kernels have had a significant impact on both algorithms and applications.   This completely new textbook reflects these recent developments while providing a comprehensive introduction to the fields of pattern recognition and machine learning. It is aimed at advanced undergraduates or first-year PhD students, as well as researchers and practitioners. No previous knowledge of pattern recognition or machine learning concepts is assumed. Familiarity with multivariate calculus and basic linear algebra is required, and some experience in the use of probabilities would be helpful though not essential as the book includes a self-contained introduction to basic probability theory.   The book is suitable for courses on machine learning, statistics, computer science, signal processing, computer vision, data mining, and bioinformatics. Extensive support is provided for course instructors, including more than 400 exercises, graded according to difficulty. Example solutions for a subset of the exercises are available from the book web site, while solutions for the remainder can be obtained by instructors from the publisher. The book is supported by a great deal of additional material, and the reader is encouraged to visit the book web site for the latest information.  
2022-03-16 00:08:22 6.63MB Pattern Recognition Machine
1
MNIST手写数字识别 概括 该项目的目的是编写一个程序,该程序可以识别和分类手写数字,并且其准确性要高于先前实现的99.15%。 为此,将训练卷积神经网络(CNN)模型并将其用于此任务。 包含70,000张图像的MNIST数据集将用于训练机器学习模型。 数据集中的每个图像都是大小为28x28像素的灰度图像。 最初,将数据集加载并存储到变量中。 然后将其分为训练数据和测试数据。 训练样本用于训练机器学习分类模型,而测试样本用于验证。 此外,将通过计算CNN模型的准确性得分来评估机器学习模型的性能。 背景技术卷积神经网络是一种由几层组成的机器学习模型。 对于分类和识别任务(包括手写数字识别)非常有用。 参考: : 挑战 由于我不熟悉OpenCV库,而且我对机器学习领域的了解有限,因此要在有限的时间内学习完成项目所需的所有概念将是一个挑战。 通过完成这个项目,我希望扩展我在机器学习概
2022-03-15 14:50:26 5KB Python
1
Speech_Emotion_Recognition
2022-03-14 16:14:48 20.41MB Python
1
使用指令的操作对SVM、MLP、LSTM算法进行了训练和测试的操作,采用CASIA数据集进行训练,需要安装python的语音处理报librosa,还有可以使用opensmile的部分,内容较为丰富
2022-03-14 00:09:48 73.04MB 语音情感识别 SVM LSTM MLP
1
Luxand Face SDK 4.0 破解版 人脸识别,面部特征识别,动态人类追踪等好用的功能
2022-03-11 20:34:07 32.5MB Face recognition
1
演讲演示 项目地址: : 文档地址: 语音识别: ://ai.baidu.com/docs#/ASR-API/top 语音合成: ://ai.baidu.com/docs#/TTS-API/top 百度语音rest api调用语音识别,语音合成示例 Rest API接口为http访问,任意操作系统,任意语言,只要能对baidu域名发起http请求的,均可以使用。 百度语音合成的接口支持跨域,但是获取令牌的接口不支持。浏览器直接发请求的示例见: : 简介 以JAVA PHP python C BASH作为示例,展示api的调用过程,选择一个运行即可。 目录结构 +--rest-api
2022-03-09 20:04:55 3.02MB rest-api speech-recognition baidu speech-to-text
1