国防科技大学
国防科技大学计算机学院
1007-130X
43-1258/TP
1973
计算机工程与科学
王志英
月刊
1-3个月
19216
42-153
¥796.00
0.9643
410073
随着教育信息化的发展,教育数据呈现特征数量高、冗余度高等特点,这使目前的分类算法在教育数据上分类准确率不理想。提出一种将特征权重算法与改进粒子群优化算法融合的混合式特征选择算法(RF-ATPSO)。该算法首先使用RELIEF-F算法计算各个特征的权重,筛除冗余特征,然后在筛选后的特征集合中利用改进粒子群算法搜索最优特征子集。实验结果表明,在6个UCI公共数据集上,经RF-ATPSO算法进行特征选择后,平均准确率提升了10.04%,且平均特征子集规模最小、收敛速度最快;在学生学业成绩画像特征数据集上,该算法以较小的特征子集规模达到较高的分类准确率,平均准确率为94.77%,明显优于其它特征选择算法,实验充分证明了该算法具有实际应用意义。
With the development of educational informatization, educational data presents characteristics such as high feature counts and high redundancy, resulting in the classification accuracy of current classification algorithms not being ideal on educational data. Therefore, this paper proposes a hybrid feature selection algorithm (RF-ATPSO) that integrates feature weighting algorithm with improved particle swarm optimization algorithm. The algorithm first uses the RELIEF-F algorithm to calculate the weights of each feature, removes redundant features, and then uses the improved particle swarm optimization algorithm to search for the optimal feature subset in the filtered feature set. Experimental results show that on 6 UCI public datasets, after feature selection using the RF-ATPSO algorithm, the average accuracy is improved by 10.04%, and the average feature subset size is the smallest and the convergence speed is the fastest. In the student academic performance portrait feature dataset, the algorithm achieves high classification accuracy with a smaller feature subset size, with an average accuracy of 94.77%, which is significantly better than other feature selection algorithms. The experiment fully demonstrates the practical application significance of this algorithm.
相关文章
[1] | 赵瑞平, 降爱莲. 基于自编码器和局部嵌入的无监督特征选择[J]. 计算机工程与科学, 2023, 45(07): 1282-1291. |
[2] | 文武, 万玉辉, 文志云, . 基于正余弦算法的文本特征选择[J]. 计算机工程与科学, 2022, 44(08): 1467-1473. |
[3] | 刘云, 肖添, 王梓宇. 动态特征选择算法对恶意行为检测的优化研究[J]. 计算机工程与科学, 2022, 44(04): 665-673. |
[4] | 苏小会, 张玉西, 徐淑萍, 尚煜. 改进K-means聚类算法行驶工况及油耗研究[J]. 计算机工程与科学, 2021, 43(11): 2020-2026. |
[5] | 张丽, 马静. 融合词语统计特征和语义信息的文本分类方法研究[J]. 计算机工程与科学, 2021, 43(07): 1308-1315. |
[6] | 张兰 雷秀娟. 几种改进PSO算法在带时间窗车辆路径问题中的比较与分析[J]. J4, 2008, 30(12): 55-59. |