Skip to main content

Machine Learning for Text

  • Textbook
  • © 2022
  • Latest edition

Overview

  • Integrates treatment of text mining/learning, information retrieval and natural language processing
  • Has a strong focus on deep learning, transformers and pre-trained language models
  • Simplifies the mathematical presentation with intuitive explanations
  • Request lecturer material: sn.pub/lecturer-material

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 69.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (17 chapters)

Keywords

About this book

This second edition textbook covers a coherently organized framework for text analytics, which integrates material drawn from the intersecting topics of information retrieval, machine learning, and natural language processing. Particular importance is placed on deep learning methods. The chapters of this book span three broad categories:1. Basic algorithms: Chapters 1 through 7 discuss the classical algorithms for text analytics such as preprocessing, similarity computation, topic modeling, matrix factorization, clustering, classification, regression, and ensemble analysis.

2. Domain-sensitive learning and information retrieval: Chapters 8 and 9 discuss learning models in heterogeneous settings such as a combination of text with multimedia or Web links. The problem of information retrieval and Web search is also discussed in the context of its relationship with ranking and machine learning methods. 

3. Natural language processing: Chapters 10 through 16 discuss various sequence-centric and natural language applications, such as feature engineering, neural language models, deep learning, transformers, pre-trained language models, text summarization, information extraction, knowledge graphs, question answering, opinion mining, text segmentation, and event detection. 

Compared to the first edition, this second edition textbook (which targets mostly advanced level students majoring in computer science and math) has substantially more material on deep learning and natural language processing. Significant focus is placed on topics like transformers, pre-trained language models, knowledge graphs, and question answering.



Authors and Affiliations

  • Mohegan Lake, USA

    Charu C. Aggarwal

About the author

Charu C. Aggarwal is a Distinguished Research Staff Member (DRSM) at the IBM T. J. Watson Research Center in Yorktown Heights, New York. He completed his undergraduate degree in Computer Science from the Indian Institute of Technology at Kanpur in 1993 and his Ph.D. in Operations Research from the Massachusetts Institute of Technology in 1996. He has published more than 400 papers in refereed conferences and journals, and has applied for or been granted more than 80 patents. He is author or editor of 20 books, including textbooks on linear algebra, machine learning (for text), neural networks, recommender systems, and outlier analysis. Because of the commercial value of his patents, he has thrice been designated a Master Inventor at IBM. He has received several internal and external awards, including the EDBT Test-of-Time Award (2014), the ACM SIGKDD Innovation Award (2019), and the IEEE ICDM Research Contributions Award (2015). He is also a recipient of the W. Wallace McDowell Award, which is the highest technical honor given by IEEE Computer Society in the field of computer science. He has served as an editor-in-chief of the ACM SIGKDD Explorations. He is currently serving as the editor-in-chief of the ACM Transactions on Knowledge Discovery from Data and as an editor-in-chief of ACM Books. He is a fellow of the SIAM, ACM, and the IEEE, for “contributions to knowledge discovery and data mining algorithms.”

Bibliographic Information

  • Book Title: Machine Learning for Text

  • Authors: Charu C. Aggarwal

  • DOI: https://doi.org/10.1007/978-3-030-96623-2

  • Publisher: Springer Cham

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: Springer Nature Switzerland AG 2022

  • Hardcover ISBN: 978-3-030-96622-5Published: 05 May 2022

  • Softcover ISBN: 978-3-030-96625-6Published: 06 May 2023

  • eBook ISBN: 978-3-030-96623-2Published: 04 May 2022

  • Edition Number: 2

  • Number of Pages: XXIII, 565

  • Number of Illustrations: 87 b/w illustrations, 5 illustrations in colour

  • Topics: Machine Learning, Data Mining and Knowledge Discovery, Information Storage and Retrieval

Publish with us