数字图像处理与应用
分享
课程详情
课程评价
spContent=深刻理解图像处理原理,生动展现图像处理过程,灵活应用图像处理方法
—— 课程团队
课程概述

  图像和视频是人类记录、表达和传递外部世界的重要视觉载体,也是感知外部世界的视觉基础,图像处理是实现物联网、机器视觉和人工智能等相关应用的基本支撑技术。

  本课程将从如下八个方面来讲授图像处理的一些基本概念,方法与技术:

   1)图像表征(依据图像基信号是否基于数据驱动,介绍图像处理中经典的傅里叶变换,离散余弦变换和基于数据驱动的主元分析法,生动展现同一幅图像在不同变换下的形式)

   2)运动估计(分别介绍像素级别的光流法和图像块级别的块匹配算法原理,以及它们应用差异之所在)

   3)图像与视频压缩技术(分别介绍包括静态数字图像压缩标准JPEG和视频压缩MPEG原理)

   4)图像半色调技术(介绍包括最简单的阈值方法,用于印刷业的聚合型抖动模板和分散型抖动模板,误差传播法等)

   5)图像滤波技术(介绍图像中常见的噪声类型,传统图像滤波,如中值滤波和高斯滤波等,以及最近出现针对纹理的滤波方法)

   6)图像插值与超分辨率技术(介绍包括传统图像插值方法,和基于图像自相似图像超分辨率技术等)

   7)图像边缘检测与分割技术(介绍包括Canny算子,mean-shift图像色彩分割方法等)

   8)视频目标跟踪技术(介绍目前较热门的Discriminative Correlation Filter(DCF)的目标跟踪技术原理)。

在授课过程中我们通过理论与实践相结合方式,以及课后大量文献阅读来加深对图像处理基本概念和理论的理解;通过实例来分析比较不同图像处理方法的优缺点;通过提出问题来引导学生独立深入思考。

授课目标

课程的目标是通过学习,能让学员掌握图像处理与计算机视觉中一些基本概念,基本研究思路和方法等,从而帮助他们展开相关领域后续深入的研究工作,和开发相关应用系统等。

课程大纲
预备知识

线性代数,信号与系统,高等数学,统计概率论等。

证书要求

为积极响应国家低碳环保政策, 2021年秋季学期开始,中国大学MOOC平台将取消纸质版的认证证书,仅提供电子版的认证证书服务,证书申请方式和流程不变。

 

电子版认证证书支持查询验证,可通过扫描证书上的二维码进行有效性查询,或者访问 https://www.icourse163.org/verify,通过证书编号进行查询。学生可在“个人中心-证书-查看证书”页面自行下载、打印电子版认证证书。

 

完成课程教学内容学习和考核,成绩达到课程考核标准的学生(每门课程的考核标准不同,详见课程内的评分标准),具备申请认证证书资格,可在证书申请开放期间(以申请页面显示的时间为准),完成在线付费申请。

 

认证证书申请注意事项:

1. 根据国家相关法律法规要求,认证证书申请时要求进行实名认证,请保证所提交的实名认证信息真实完整有效。

2. 完成实名认证并支付后,系统将自动生成并发送电子版认证证书。电子版认证证书生成后不支持退费。


参考资料

[1]  C. Tomasi and R. Manduchi, “Bilateral filtering for gray and color images,” in Proc. of IEEE International Conference on Computer Vision (ICCV), 1998.

[2]   D. Glasner, S. Bagon, and M. Irani, “Super-resolution from a single image,” in Proc. of IEEE International Conference on Computer Vision (ICCV), pp.349-356, 2009.

[3]  P. Felzenszwalb and P. Huttenlocher, “Efficient belief propagation for early vision,” International Journal of Computer Vision, vol.70, no.1, 2006.

[4]  D. Comaniciu and P. Meer, “Mean shift: a robust approach toward feature space analysis,” IEEE Trans. On Pattern Analysis and Machine Intelligence, vol.24, no.5, pp. 603-619, 2002.

[5]  X. Lu, “Color textile image segmentation based multiscale probabilistic reasoning,” Optical Engineering, vol.46, no.8, 087002, 2007.

[6]    A. V. Oppenheim, A. S. Willsky and I. T. Young, Signals and Systems, Prentice-Hall, 1983.

[7]    M.J.T. Smith and A. Docef, A Study Guide for Digital Image Processing, Scientific Publishers, Inv. Riverdale, Georgia, 1999.

[8]    G. Johansson, “Visual perception of biological motion and a model for its analysis", Perception and Psychophysics, vol.14, 201-211, 1973.

[9]  B. Lucas and T. Kanade, “An iterative image registration technique with an application to stereo vision,” in Proc. of International Joint Conf. On Artificial Intelligence, pp.674-679, 1981.

[10]  B. Horn and B. Schunck, “Determining optical flow,” Artificial Intelligence, 17:185-203, 1981.

[11]   T. Koga, K. Iinuma, A. Hirano, Y. Iijima, and T. Ishiguro, “Motion compensated interframe coding for video conferencing,” Proceedings of national Telecommunications conference, New Orleans, LA, pp.G5.3.1–G5.3.5, Dec. 1981.

[12]   R. Li, B. Zeng, and M. L. Liou, “A new three-step algorithm for block motion estimation,” IEEE Trans. On Circuits and Systems for Video Technology, 4(4): 438-442, 1994.

[13]   L.-M. Po and W.-C. Ma, “A novel four-step search algorithm for fast block motion estimation,” IEEE Trans. On Circuits and Systems for Video Technology, 6(3): 313-317, 1996.

[14]   S. Zhu and K.-K. Ma, “A new diamond search algorithm for fast block-matching motion estimation,” IEEE Trans. On Image Processing, 9(2): 287-290, 2000.

[15]    G. K. Wallace "The JPEG still picture compression standard", Communications of the ACM, 34(4):30-44, April 1991.

[16]    Z. Wang, A.C. Bovik, H.R. Sheikh, and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,” IEEE Trans. On Image Processing, 13(4):600-612, 2004.

[17]    B. E. Bayer, “An optimum method for two-level rendition of continuous-tone pictures,” in Proceedings of the IEEE International Conference on Communication, pp. 11-26, 1973.

[18]  R. Floyd and L. Steinberg, “An adaptive algorithm for spatial grey scale,” Society for Information Display Symposium, Digest of Technical Papers, pp.36-37, 1975.

[19]    X.-Q. Lu and H. Sakaino, “A spatial adaptive filter for smoothing of non-Gaussian texture noise” in Proc. of ICASSP, 2009.

[20]    L. Rudin, S. Osher and E. Fatemi, “Nonlinear total variation based noise removal,” Physical D, 60:259-268, 1992.

[21]    H. Cho, H. Lee, H. Kang, and S. Lee, ” Bilateral texture filtering,” ACM Transactions on Graphics, 33(4):1-8, 2014.

[22]    R. G. Keys, “Cubic convolution interpolation for digital image processing,” IEEE Trans. On Acoustics, Speech, and Signal Processing, ASSP-29(6): 1153-1160, 1981.

[23]    X. Li and M. T. Orchard, “New edge-directed interpolation,” IEEE Trans. On Image Processing, 10(10): 1521-1527, 2001.

[24]    J. Canny, “A Computational Approach to Edge Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, 8(6), pp. 679-698, 1986.

[25]    J.W. Cook and E.J. Delp, “Multiresolution sequential edge linking,” in Proc. IEEE Intl. Conf. Image Processing, pp.41-44, 1995.

[26]    D. Comaniciu and P. Meer, "Robust analysis of feature spaces: color image segmentation," in Proc. of IEEE CVPR1997.

[27]    D.S. Bolme, J.R. Beveridge, B.A. Draper, and Y.M. Lui, “Visual object tracking using adaptive correlation filters,” in Proc. of CVPR, 2010.

[28]  J.F. Henriques, R. Caseiro, P. Martins, and J. Batista, “High-speed tracking with kernelized correlation filter” IEEE Trans. On Pattern Analysis and Machine Intelligence, vol.37, no.3, pp.583-596, Mar. 2015.