基于开源URL数据字符串特征的恶意性检测项目源码+数据集+模型+项目说明.7z

上传者: DeepLearning_ | 上传时间: 2022-12-13 13:25:58 | 文件大小: 26.97MB | 文件类型: 7Z
基于开源URL数据字符串特征的恶意性检测项目源码+数据集+模型+项目说明.7z 从kdnuggets上收集到了带标签(good/bad)的URL数据集,共416350条,其中异常数据(bad)71556条,占比17.19%; 正常数据(good)344794条,占比82.81%。 将全体数据划分为训练集(70%),验证集(15%)和测试集(15%),并且在每个集合中均保持异常数据所占比例相同。 分类器模型 准确度(%) 精确度(%) 召回率(%) 贝叶斯 85.88 60.82 50.25 AdaBoost 92.84 86.05 69.65 随机森林 97.13 95.9 87.05 决策树 94.63 83.9 85.11 逻辑回归 90.86 83.29 58.58 梯度提升树 96.35 93.7 84.45 基于投票的分类器 97.1 92.51 90.48

文件下载

资源详情

[{"title":"( 24 个子文件 26.97MB ) 基于开源URL数据字符串特征的恶意性检测项目源码+数据集+模型+项目说明.7z","children":[{"title":"data","children":[{"title":"data.csv <span style='color:#111;'> 21.97MB </span>","children":null,"spread":false},{"title":"bad_urls.csv <span style='color:#111;'> 555.52KB </span>","children":null,"spread":false},{"title":"popular_web.txt <span style='color:#111;'> 3.93KB </span>","children":null,"spread":false},{"title":"fishtank+train.csv <span style='color:#111;'> 18.61MB </span>","children":null,"spread":false},{"title":"test_data","children":[{"title":"fishtank.csv <span style='color:#111;'> 2.85MB </span>","children":null,"spread":false}],"spread":true},{"title":"badwords.txt <span style='color:#111;'> 2.33KB </span>","children":null,"spread":false},{"title":"World_Top_500.csv <span style='color:#111;'> 6.33KB </span>","children":null,"spread":false},{"title":"splited_data","children":[{"title":"Cross Validation set.csv <span style='color:#111;'> 16.75MB </span>","children":null,"spread":false},{"title":"Training set.csv <span style='color:#111;'> 78.08MB </span>","children":null,"spread":false},{"title":"Test set.csv <span style='color:#111;'> 17.03MB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"pearson_correlation.py <span style='color:#111;'> 889B </span>","children":null,"spread":false},{"title":"single_test.py <span style='color:#111;'> 1.47KB </span>","children":null,"spread":false},{"title":"use_sklearn.py <span style='color:#111;'> 23.90KB </span>","children":null,"spread":false},{"title":"source.py <span style='color:#111;'> 2.63KB </span>","children":null,"spread":false},{"title":"badword_cloud.png <span style='color:#111;'> 55.12KB </span>","children":null,"spread":false},{"title":"whois_info.py <span style='color:#111;'> 208B </span>","children":null,"spread":false},{"title":"get_popular.py <span style='color:#111;'> 3.45KB </span>","children":null,"spread":false},{"title":"项目说明.md <span style='color:#111;'> 13.32KB </span>","children":null,"spread":false},{"title":"get_fishtank.py <span style='color:#111;'> 3.48KB </span>","children":null,"spread":false},{"title":"feature_extraction.py <span style='color:#111;'> 14.13KB </span>","children":null,"spread":false},{"title":"pearson.png <span style='color:#111;'> 655.38KB </span>","children":null,"spread":false},{"title":"bad_urls.py <span style='color:#111;'> 5.18KB </span>","children":null,"spread":false},{"title":"virus_total_check.py <span style='color:#111;'> 752B </span>","children":null,"spread":false},{"title":"data_split.py <span style='color:#111;'> 1.04KB </span>","children":null,"spread":false}],"spread":true}]

评论信息

其他资源

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明