Artificial Intelligence Laboratory


Resources Page for MIS 464 and MIS 611D

Class Resources for MIS 464, Data Analytics, and MIS 611D, Topics in Data and Web Mining (Spring 2019)

Instructor: Hsinchun Chen, Ph.D., Professor, Management Information Systems Dept, Eller College of Management, University of Arizona

TOPIC 1: Introduction (MIS, CS, BI, AI, Data Science)

  1. University of Arizona MIS Program: Overview, by Dr. Hsinchun Chen
  2. Business Analytics Intellectual Landscape , by Dr. Hsinchun Chen, 2020, pdf version
  3. MIS Analytics Curriculum, by Dr. Hsinchun Chen, 2020 pdf version
  4. History and Key Events of AI and Data Analytics , by Dr. Hsinchun Chen, 2020
  5. ACM Student Membership Application and Order Form [PDF], [Online Link]
  6. IEEE Student Membership Application
  7. Ideas for the Future of the IS Field, by G. B. Davis, P. Gray, S. Madnick, J. F. Nunamaker, R. Sprague, and A. Whinston. Transactions on Management Information Systems, Volume 1, Issue 1, pp. 2:1 - 2:15. [PDF copy here]
  8. Design Science, Grand Challenges, and Societal Impacts, by Hsinchun Chen. Transactions on Management Information Systems, Volume 2, Issue 1, pp. 1:1 - 1:10. [PDF copy here]
  9. Journals, Conferences, and Funding Sources for MIS Researchers and Educators: A Resource Guide, by Dr. Hsinchun Chen (Updated 2019)
  10. ISI Ranking of Top Computer Science and Information Systems Journals, from the ISI Web of Knowledge 2017 edition, updated in December 2018
  11. The H-Index for MIS, February 2019
  12. The H-Index for Computer Science, by Jens Palsberg, January 2020
  13. Template for Producing IT Research and Publication, by Dr. Hsinchun Chen
    1. IEEE Template - Word Document
    2. IEEE Paper Examples
      1. Exploring Threats and Vulnerabilities in Hacker Web: Forums, IRC and Carding Shops, by Benjamin et al., 2015, IEEE ISI
      2. Developing Understanding of Hacker Language through the use of Lexical Semantics , by Benjamin and Chen, 2015, IEEE ISI
      3. Detecting Cyber Threats in Non-English Dark Net Markets: A Cross-Lingual Transfer Learning Approach, by Ebrahimi et al., 2018, IEEE ISI
      4. Identifying, Collecting, and Presenting Hacker Community Data: Forums, IRC, Carding Shops, and DNMs, by Du et al., 2018, IEEE ISI
  14. Sample Research by Applications, by Dr. Hsinchun Chen
    1. Cybersecurity and Security Analytics
      1. Criminal Network Analysis and Visualization, by Jennifer Xu and Hsinchun Chen , 2005, CACM
      2. AI and Security Informatics, by Hsinchun Chen, (September/October 2010), IEEE IS
      3. CyberGate: A Design Framework and System for Text Analysis of Computer-Mediated Communication, by Abbasi and Chen, December 2008, MISQ
      4. Detecting Fake Websites: The Contribution of Statistical Learning Theory, by Abbasi et al., September 2010, MISQ
      5. DICE-E: A Framework for Conducting Darknet. Identification, Collection, Evaluation, with Ethics, by Victor Benjamin, Joseph S. Valacich, and Hsinchun Chen, 2019, MISQ
    2. Smart Health
      1. Smart Health and Wellbeing, by Hsinchun Chen (September/October 2011), IEEE IS
      2. AI for Global Disease Surveillance, by Hsinchun Chen and Daniel Zeng (November/December 2009), IEEE IS
      3. Time-To-Event Predictive Modeling for Chronic Conditions Using Electronic Health Records, by Yu-Kai Lin, Hsinchun Chen, Randall A. Brown, Shu-Hsing Li, and Hung-Jen Yang (2014), JBI
      4. Healthcare Predictive Analytics for Risk Profiling in Chronic Care: A Bayesian Multitask Learning Approach, by Yu-Kai Lin et al., 2017, MISQ
      5. Connecting Systems, Data, and People: A Multidiciplinary Research Roadmap for Chronic Disease management, by Indranil Bardhan, Hsinchun Chen, and Elena Karahanna, 2020, MISQ
    3. Smart Business and Money
      1. Business and Market Intelligence 2.0, by Hsinchun Chen (January/February 2010), IEEE IS
      2. Smart Market and Money, by Hsinchun Chen (November/December 2011), IEEE IS
      3. AI and Opinion Mining, by Hsinchun Chen and David Zimbra (May/June 2010), IEEE IS
      4. Web Media and Stock Markets : A Survey and Future Directions from a Big Data Perspective, by Qing Li, Yan Chen, Jun Wang, Yuanzhu Chen, and Hsinchun Chen, 2018, IEEE TKDE
      5. A Multimodal Event-driven LSTM Model for Stock Prediction Using Online News, by Qing Li, Jinghua Tan, Jun Wang, and Hsinchun Chen, 2020, IEEE TKDE
    4. Sports and Games Analytics
      1. Expert Prediction, Symbolic Learning, and Neural Networks-An Experiment on Greyhound Racing, by Hsinchun Chen et al., IEEE Expert (December 1994), IEEE Expert
      2. Sports Data Mining, by Robert Schumaker, Osama Solieman, and Hsinchun Chen, 2010, Springer
      3. AI, Virtual Worlds, and Massively Multiplayer Online Games, by Hsinchun Chen and Yulei Zhang (January/February 2011), IEEE IS
  15. Sample NSF Proposal for Cybersecurity , by Sagar Samtani, 2019
  16. Publishing in Major Journals & Getting Major Grants Consistently, by Dr. Hsinchun Chen, 2020
  17. Design Science in Information Systems Research, by Alan R. Hevner, Salvatore T. March, Jinsso Park, and Sudha Ram. MIS Quarterly, Volume 28, Number 1, pp. 75-105, March 2004.
  18. Positioning and Presenting Design Science Research for Maximum Impact, by Shirley Gregor and Alan R. Hevner. MIS Quarterly, Volume 37, Number 2, pp. 337-355, June 2013.
  19. Editor's Comments: Diversity of Design Science Research, by Arun Rai, Andrew Burton-Jones, Hsinchun Chen, Alok Gupta, Alan R. Hevner, and Wolfgang Ketter. MIS Quarterly, Volume 41, Number 1, pp. iii-xviii, March 2017.
  20. MISQ BI Special Issue: Business Intelligence and Analytics: From Big Data to Big Impact, by Hsinchun Chen et al. (2012).
  21. UC Berkeley’s Fastest-Growing Class Is Data Science 101, by Douglas Belkin, WSJ, November 2, 2018
  22. The 50 Best Jobs in America for 2019 (Glassdoor Ranking), January 23, 2019.
  23. The State of Data Science and Machine Learning - Kaggle Survey 2017, 2017. [Interactive Online Version]
  24. Special Issue: BD2K Centers Open Doors to Discovery, Biomedical Computation Review, Summer 2017. [Online]
  25. CyberGate: A Design Framework and System for Text Analysis of Computer-Mediated Communication, by Abbasi and Chen, December 2008 (MISQ) - PDF
  26. CyberGate: A Design Framework and System for Text Analysis of CMC, by Abbasi and Chen, 2008 - PPT
  27. Detecting Fake Websites: The Contribution of Statistical Learning Theory, by Abbasi et al., September 2010 (MISQ) - PDF
  28. DICE-E: A Framework for Conducting Darknet. Identification, Collection, Evaluation, with Ethics, by Victor Benjamin, Joseph S. Valacich, and Hsinchun Chen, 2019 (MISQ) - PDF
  29. Healthcare Predictive Analytics for Risk Profiling in Chronic Care: A Bayesian Multitask Learning Approach, by Yu-Kai Lin et al., 2017 (MISQ) - PDF
  30. Do Electronic Health Records Affect Quality of Care? Evidence from the HITECH Act, by Yu-Kai Lin et al., 2019 (ISR) - PDF
  31. His Promise to Heal Bad Hearts Relied on Mountain of False Data, by Gina Kolata, WSJ, October 30, 2018. [Online]
  32. Big Data Technology - Hadoop, MapReduce, and Spark (Jonathan Jiang, with updates from Sagar Samtani and Shuo Yu, 2019)
  33. Recalibrating global data center energy-use estimates by Eric Masanet et al., 2020, Science
  34. Cloud Computing Is Not the Energy Hog That Had Been Feared by Steve Lohr, 2020, New York Times
  35. Introduction to Blockchain and a Demo on Financial Trades (Eric Tham, 2018)
  36. Python Overview for Data Analytics
    1. Python for Data Analytics, by Mohammadreza Ebrahimi and Hsinchun Chen, 2020
    2. Introduction to Computation and Programming Using Python, by John Guttag, 2013
  37. Tableau Overview and Publicly Available Data Sources (Sagar Samtani and Hsinchun Chen, with updates from Hongyi Zhu, 2019)
    1. Sample NFL Dataset for Visualization
  38. Dark Web and Privacy Analytics Research: Hands-on Training and Planning, by Ebrahimi et. al., 2020
  39. Smart Vulnerability Assessment for OS/VM, GitHub, IoT: An Overview, by Ullman et. al., 2020

TOPIC 2: Web Mining (Surface Web, Deep Web, Social Web)

  1. Inside Internet Search Engines
    1. Fundamentals, by Jan Pedersen and William Chang (SIGIR 1999)
    2. Spidering and Indexing, by Jan Pedersen and William Chang (SIGIR 1999)
    3. Search, by Jan Pedersen and William Chang (SIGIR 1999)
    4. Products, by William Chang and Jan Pedersen (SIGIR 1999)
    5. Business, by William Chang and Jan Pedersen (SIGIR 1999)
  2. Search Engines and Their Algorithms, by C. Lee Giles (2018) (33M)
  3. The Anatomy of a Large-Scale Hypertextual Web Search Engine, S. Brin and L. Page (1998)
  4. Google Architecture and Technologies, by Hsinchun Chen (2020)
  5. Page Rank and Google Story, by Vise and Malseed, 2005
  6. AI, Chapter 4. Search Algorithm, Winston (1984)
  7. Search Algorithms with Examples, by Hsinchun Chen and Mohammadreza Ebrahimi (2020)
  8. GA Handout (27M)
  9. Network Science by Sagar Samtani, Weifeng Li, Hsinchun Chen, 2016
  10. The Great Giveaway (25M), by Erick Schonfeld, Business 2.0 (April 2005)
  11. The Long Tail, by Chris Anderson, WIRED Magazine (December 2004)
  12. Web 2.0 ... The Machine is Us/ing Us (YouTube)
  13. What Is Web 2.0? Design Patterns and Business Models for the Next Generation of Software, by Tim O'Reilly (2005)
  14. Web 2.0: Introduction, by Hsinchun Chen, 2009
  15. Facebook Story (2012)
  16. Communications of the ACM (2011):
    1. Reflecting on the DARPA Red Balloon Challenge, by John C. Tang et al. (April 2011)
    2. Crowdsourcing Systems on the World-Wide Web, by Anhai Doan et al. (April 2011)
    3. An Overview of Business Intelligence Technology, by Surajit Chaudhuri et al. (August 2011)
  17. World (Patent) War, from the Bloomberg Businessweek Technology section, March 12, 2012.
  18. The Netflix Recommender System: Algorithms, Business Value, and Innovation (Uribe and Hunt, 2015)
  19. Matrix Factorization Techniques for Recommender Systems (Koren, Bell, and Volinsky, 2009)
  20. Zillow awards $1 million to data scientists for improving its Zestimate algorithm, by Natalie Gagliordi (January 2019)
  21. Data Science and Prediction, by Vasant Dhar (2013)
  22. Harvard Business Review (October 2012)
    1. Big Data: The Management Revolution (from HBR 12/12)
    2. Data Scientist: The Sexiest Job Of the 21st Century (from HBR 12/12)
    3. Making Advanced Analytics Work For You (from HBR 12/12)
  23. Hype Cycle for Business Intelligence, 2011, by Andreas Bitterer, Gartner Report (Aug. 12 2011)
  24. Magic Quadrant for Business Intelligence Platforms, by Rita L. Sallam et al., Gartner Report (Jan. 27 2011)
  25. The 2011 IBM Tech Trends Report, by IBM (Nov. 15th, 2011)
  26. The Economist A Special Report on Social Networking---A World of Connections (January 30th 2010):
    1. A world of connections (from The Economist 1/30/10)
    2. Global swap shops (from The Economist 1/30/10)
    3. Twitter's transmitters (from The Economist 1/30/10)
    4. Profiting from friendship (from The Economist 1/30/10)
  27. The Economist, Data, Data, Everywhere: A Special Report on Managing Information (February 25th 2010); includes the following pieces:
    1. The data deluge
    2. Data, data everywhere
    3. All too much
    4. A different game
    5. Show me
    6. Needle in a haystack
    7. New rules for big data
    8. Clicking for gold
    9. Handling the cornucopia
    10. The open society
    11. Sources and acknowledgments
  28. The Economist, A Special Report on Personal Technology (October 8th 2011). Includes the following sections:
    1. Beyond the PC
    2. The Power of Many
    3. The Beauty of Bite-sized Software
    4. IT's Arab Spring
    5. Up Close
  29. The Economist, Special Report, Cyber-Security, July 12, 2014: Defending the Digital Frontier. Includes the following sections:
    1. Cybercrime: Hackers, Inc.
    2. Vulnerabilities: Zero-day game
    3. Business: Digital disease control
    4. Critical infrastructure: Crashing the system
    5. Market failures: Not my problem
    6. The Internet of Things: Home, hacked home
    7. Remedies: Prevention is better than cure
  30. The Economist, Technology Quarterly, Civilian Drones, June 10, 2017: Taking Flight. Includes the following sections:
    1. Give and take
    2. Seeing is believing
    3. Can drones deliver the goods?
    4. Rules and tools
  31. The Economist, Special Report, The Economics of Longevity, July 8, 2017: The New Old. Includes the following sections:
    1. Footloose and fancy-free
    2. Rock around the clock
    3. Don't call us silver
    4. Your money and your life
    5. Tablets for every problem
    6. A blessing, not a burden
  32. The Economist, September 9, 2017: Facial Industry. Includes the following sections:
    1. The facial-industry complex
    2. Keeping a straight face
    3. Making faces from DNA
  33. The Economist, Special Report, Autonomous Vehicles, March 3, 2018: Reinventing Wheels. Includes the following sections:
    1. From here to autonomy
    2. Selling rides, not cars
    3. The new autopia
    4. A different world
    5. Rules of the road
  34. The Economist, Special Report, AI in Business, March 31, 2018: GrAIt Expectations. Includes the following sections:
    1. In algorithms we trust
    2. Here to help
    3. Hire education
    4. Simile, you're on camera
    5. Leave it to the experts
    6. Two faced
  35. A Special Report on Artificial Intelligence. The New York Times, October 19, 2018. Includes the following articles:
    1. Workers Beware, by David Kaufman, October 18, 2018
    2. What Comes After the Roomba? by John Markoff, October 21, 2018
    3. The Computerized Chauffeur, by Norman Mayersohn, October 19, 2018
    4. A.I. Is Begining to Assist Novelists, by David Streitfeld, October 18, 2018
    5. The A.I. Wave Is Here, by Steve Lohr, October 21, 2018
    6. Acknowledging the Pitfalls, Too, by Cade Metz, October 22, 2018
    7. Will There Be a Ban on Killer Robots? by Adam Satariano, October 19, 2018
    8. Breaking Big Tech's Hold on A.I., by Nathaniel Popper, October 20, 2018
  36. The Wall Street Journal's Recent Articles on 5G and Smart City, 2019. Includes the following articles:
    1. The Power of Combining 5G and AI , by James Rundle and Angus Loten, WSJ, Nov 8, 2019.
    2. The Good News About 5G Security, by Adam Janofsky, WSJ, Nov 10, 2019.
    3. 5G Race Could Leave Personal Privacy in the Dust, by Drew FitzGerald, WSJ, Nov 11, 2019.
    4. How Hackers Could Break Into the Smart City, by James Rundle, WSJ, Sep 17, 2019.
  37. IoT Device Security, by Steve Ullman, 2019.
  38. Cyber Threat Intelligence, by Sagar Samtani and Hsinchun Chen, 2019
  39. Looking to the Future of Cybersecurity, by Fang Yu Lin and Hsinchun Chen, 2019
  40. Computational Propaganda and Political Disinformation, by Zara AhmadPost and Steve Ullman, 2019.
  41. Introduction to Web Application and APIs (Revised by Jonathan Jiang and Julian Guo):
    1. Flickr Photo Search API Sample Code
    2. Amazon Product Advertising API Sample Code
    3. YouTube Data API Sample Code
    4. Yelp API Sample Code

TOPIC 3: Data Mining (Machine Learning, Deep Learning, AI)

  1. Predictive Analytics for Data Mining (Weifeng Li, Sagar Samtani, Hsinchun Chen, 2020)
  2. Publicly Available Data Sources (Sagar Samtani and Hsinchun Chen, with updates from Hongyi Zhu, 2019)
  3. Logistic Regression and Elastic Net (Weifeng Li, Hsinchun Chen, 2016)
  4. Pattern Recognition using Support Vector Machine: Text Classification and Cybersecurity (Ahmed Abbasi, Hsinchun Chen, 2020)
  5. Clustering for Data Mining: Overview and Examples (Dr. Hsinchun Chen, 2020)
  6. Neural Networks: Feedforward Backpropagation NN and Self-Organizing Map (Dr. Hsinchun Chen, 2020)
  7. Deep Learning: An Overview (Hsinchun Chen, 2020)
  8. An Introduction to Convolutional Neural Networks: Overview, Implementation, and Example (Shuo Yu and Hsinchun Chen, 2020)
  9. An Introduction to Recurrent Neural Networks: Overview, Implementation, and Application (Hongyi Zhu and Hsinchun Chen, 2020)
  10. WEKA Overview (Sagar Samtani, Weifeng Li, and Hsinchun Chen, with updates from Shuo Yu, 2019)
    1. iris-train, iris-test, houses-train, houses-test
  11. Top 10 Algorithms in Data Mining (PDF)
  12. ID3 Handout
  13. Backpropagation Neural Network Handout
  14. Self-organizing maps: an introduction
  15. K-means algorithm
  16. Expert Prediction, Symbolic Learning, and Neural Networks-An Experiment on Greyhound Racing, by Hsinchun Chen et al., IEEE Expert (December 1994)
  17. Introduction to Support Vector Machine (SVM) and Conditional Random Field (CRF) (Long Version, Short Version)
  18. Google masters Go (Nature, Elizabeth Gibney, January 28, 2016)
  19. Artificial Intelligence Go Showdown (The Economist, March 12, 2016)
  20. Artificial Intelligence - Million Dollar Babies - The Economist, April 2, 2016
  21. Mastering the game of Go with deep neural networks and tree search (Nature, Silver et al., 2016)
  22. The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation, by Miles Brundage et al., February 2018.
  23. Deep Learning (Nature, LeCun et al., 2015)
  24. Machine Learning: Trends, Perspectives, and Prospects (Science, Jordan and Mitchell, 2015)
  25. Editorial: Chess, a Drosophila of Reasoning (Science, Kasparov, 2018)
  26. One Giant Step for a Chess-Playing Machine (New York Times, Strogatz, 2018)
  27. A General Reinforcement Learning Algorithm That Masters Chess, Shogi, and Go Through Self-play (Science, Silver et al., 2018)
  28. Autoencoders: Overviews and Selected Application (Sagar Samtani and Hsinchun Chen, 2018)
  29. An Introduction to Deep Transfer Learning (Mohammadreza Ebrahimi and Hsinchun Chen, 2018)
  30. Deep Generative Models: An Overview (Yidong Chai, Weifeng Li, and Hsinchun Chen, 2018)
  31. Artificial Intelligence and Deep Learning (Lee Giles, 2018)
  32. Representation Learning (Alexander G. Ororbia II and Lee Giles, 2018)

TOPIC 4: Text Mining (Sentiment Analysis, Topic Modeling, Visualization)

  1. Text Mining: Techniques, Tools, Ontologies and Shared Tasks (Xiao Liu, Shuo Yu, Hsinchun Chen, 2020)
  2. An Overview of Topic Modeling (Weifeng Li and Hsinchun Chen, 2018)
  3. Topic Modeling and Latent Dirichlet Allocation: An Overview (Weifeng Li, Sagar Samtani, and Hsinchun Chen, 2016)
  4. Information Visualization
  5. Information Visualization for Digital Library (2.21M)
  6. Visualizing Data: Frameworks and Examples (Hongyi Zhu, Sagar Samtani, Hsinchun Chen, Spring 2019)

TOPIC 5: Emerging Research in Data and Web Mining (for MIS 611D)

  1. COPLINK, Dark Web, and Hacker Web: A Research Path in Security Informatics, by Dr. Hsinchun Chen
  2. Criminal Network Analysis and Visualization, by Jennifer Xu and Hsinchun Chen , 2005
  3. The Topology of Dark Networks, by Jennifer Xu and Hsinchun Chen
  4. Exploring Dark Networks: From the Surface Web to the Dark Web, by Hsinchun Chen, October 2017
  5. CyberGate: A Design Framework and System for Text Analysis of CMC, by Ahmed Abbasi and Hsinchun Chen
  6. MedTime: A Temporal Information Extraction System for Clinical Narratives, by Yu-Kai Lin, Hsinchun Chen and Randall A. Brown (2013)
  7. Smart and Connected Health: Guest Editors' Introduction, by Gondy Leroy, Hsinchun Chen, and Thomas C. Rindflesch (2014)
  8. Time-To-Event Predictive Modeling for Chronic Conditions Using Electronic Health Records, by Yu-Kai Lin, Hsinchun Chen, Randall A. Brown, Shu-Hsing Li, and Hung-Jen Yang (2014)
  9. Identifying Adverse Drug Events from Patient Social Media: A Case Study for Diabetes, by Xiao Lu and Hsinchun Chen (2015)
  10. HackerWeb and Shodan Access (Jonathan Jiang)
    1. Hacker Web Sample Code
    2. Shodan Sample Code
  11. Homeland Security Data Mining using Social (Dark) Network Analysis, ISI 2008, Keynote Address, by Dr. Chen (18.4M)
  12. Health Big Data Analytics: Clinical Decision Support and Patient Empowerment, by Dr. Hsinchun Chen
  13. IEEE Intelligent Systems, Trends & Controversies; with introductions by Dr. Hsinchun Chen (2009, 2010, 2011):
    1. AI and Global Science and Technology Assessment, by Hsinchun Chen (July/August 2009)
    2. AI, E-Government, and Politics 2.0, by Hsinchun Chen (September/October 2009)
    3. AI for Global Disease Surveillance, by Hsinchun Chen and Daniel Zeng (November/December 2009)
    4. Business and Market Intelligence 2.0, by Hsinchun Chen (January/February 2010)
    5. AI and Opinion Mining, by Hsinchun Chen and David Zimbra (May/June 2010)
    6. AI and Security Informatics, by Hsinchun Chen, (September/October 2010)
    7. AI, Virtual Worlds, and Massively Multiplayer Online Games, by Hsinchun Chen and Yulei Zhang (January/February 2011)
    8. Smart Health and Wellbeing, by Hsinchun Chen (September/October 2011)
    9. Smart Market and Money, by Hsinchun Chen (November/December 2011)
  14. Recent Research at the Artificial Intelligence Lab of the University of Arizona: AZSecure for Advanced Cyber Threat Intelligence and SilverLink for Proactive Mobile Health, by Hsinchun Chen, March 2018.
  15. Cybersecurity and AI: A Data Science Perspective, by Hsinchun Chen, November 2018.
  16. Cyber Threat Intelligence (February 2019)
    1. Overview Fundamental
    2. Hacker Community Data
    3. Hacker Assets Portal
    4. Looking to the Future

    MISCELLANEOUS RESOURCES


    Class Page for MIS 464, Data Analytics

    Class page for MIS 611D, Topics in Data and Web Mining

    AI Lab Website

    Photo provided through courtesy of DARPA and available through Wikimedia Commons.