
Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
Paperback
Series: ACM Books
DatabasesSystem Administration
ISBN13: 9781970001167
Publisher: Morgan & Claypool
Published: Jun 30 2016
Pages: 530
Weight: 1.99
Height: 1.07 Width: 7.50 Depth: 9.25
Language: English
Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. This has led to an increasing demand for powerful software tools to help people analyze and manage vast amounts of text data effectively and efficiently. Unlike data generated by a computer system or sensors, text data are usually generated directly by humans, and are accompanied by semantically rich content. As such, text data are especially valuable for discovering knowledge about human opinions and preferences, in addition to many other kinds of knowledge that we encode in text. In contrast to structured data, which conform to well-defined schemas (thus are relatively easy for computers to handle), text has less explicit structure, requiring computer processing toward understanding of the content encoded in text. The current technology of natural language processing has not yet reached a point to enable a computer to precisely understand natural language text, but a wide range of statistical and heuristic approaches to analysis and management of text data have been developed over the past few decades. They are usually very robust and can be applied to analyze and manage text data in any natural language, and about any topic.
This book provides a systematic introduction to all these approaches, with an emphasis on covering the most useful knowledge and skills required to build a variety of practically useful text information systems. The focus is on text mining applications that can help users analyze patterns in text data to extract and reveal useful knowledge. Information retrieval systems, including search engines and recommender systems, are also covered as supporting technology for text mining applications. The book covers the major concepts, techniques, and ideas in text data mining and information retrieval from a practical viewpoint, and includes many hands-on exercises designed with a companion software toolkit (i.e., MeTA) to help readers learn how to apply techniques of text mining and information retrieval to real-world text data and how to experiment with and improve some of the algorithms for interesting application tasks. The book can be used as a textbook for a computer science undergraduate course or a reference book for practitioners working on relevant problems in analyzing and managing text data.
1 different editions
Also available
Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
Zhai, Chengxiang
Massung, Sean
Hardcover
Also in
System Administration
Site Reliability Engineering: How Google Runs Production Systems
Murphy, Niall Richard
Jones, Chris
Beyer, Betsy
Paperback
Comptia A+ Complete Certification Kit: Exam 220-1101 and Exam 220-1102
Docter, Quentin
McMillan, Troy
Buhagiar, Jon
Paperback
RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone
Rothman, Denis
Paperback
Prometheus: Up & Running: Infrastructure and Application Performance Monitoring
Brazil, Brian
Pivotto, Julien
Paperback
Windows 11 Manual For Seniors: A Beginners Guide to Navigate Your Computer with Step-by-Step Instructions
Wells, Larry
Paperback
Averting the Digital Dark Age: How Archivists, Librarians, and Technologists Built the Web a Memory
Milligan, Ian
Hardcover
Practical Cybersecurity Architecture - Second Edition: A guide to creating and implementing robust designs for cybersecurity architects
Kelley, Diana
Moyle, Ed
Paperback
Building Multi-Tenant Saas Architectures: Principles, Practices, and Patterns Using AWS
Golding, Tod
Paperback
Data and Reality: A Timeless Perspective on Perceiving and Managing Information in Our Imprecise World, 3rd Edition
Kent, William
Paperback
Learning eBPF: Programming the Linux Kernel for Enhanced Observability, Networking, and Security
Rice, Liz
Paperback
Learn Powershell in a Month of Lunches, Fourth Edition: Covers Windows, Linux, and macOS
Petty, James
Leonhardt, Tyler
Plunk, Travis
Paperback
Microsoft 365 and SharePoint Online Cookbook - Second Edition: A complete guide to Microsoft Office 365 apps including SharePoint, Power Platform, Cop
Chamberlain, Nate
Mahajan, Gaurav
Ghatak, Sudeep
Paperback
Mastering Windows Security and Hardening - Second Edition: Secure and protect your Windows environment from cyber threats using zero-trust security pr
Dunkerley, Mark
Tumbarello, Matt
Paperback
Comptia A+ Complete Practice Tests: Core 1 Exam 220-1101 and Core 2 Exam 220-1102
Parker, Jeff T.
O'Shea, Audrey
Paperback
Observability Engineering: Achieving Production Excellence
Fong-Jones, Liz
Miranda, George
Majors, Charity
Paperback
Mastering Office 365 Administration: A complete and comprehensive guide to Office 365 Administration - manage users, domains, licenses, and much more
Carpe, Thomas
Carter, Nikkia
Rogers, Alara
Paperback
Mastering KVM Virtualization - Second Edition: Design expert data center virtualization solutions with the power of Linux KVM
Chirammal, Humble Devassy
Mukhedkar, Prasad
Dakic, Vedran
Paperback
IT Helpdesk Training Best Practices: Desktop Support Troubleshooting and System Administration
Botwright, Rob
Paperback
Free Opensource Office Suite Software Apps For Windows 11 OS Hardcover Ver
Sakura, Cyber Jannah
Hardcover
Multi-Cloud Mastery: Architecting Secure and Scalable Kubernetes Systems and Infrastructures.
Robertson, Adam
Paperback
Learn Powershell Scripting in a Month of Lunches, Second Edition: Write and Organize Scripts and Tools
Jones, Don
Hicks, Jeffery
Petty, James
Paperback
pfSense Essentials: The Complete Reference to the pfSense Internet Gateway and Firewall
Reed, Jeremy C.
Paperback
Cloud Native Data Security with Oauth: A Scalable Zero Trust Architecture
Archer, Gary
Kahrer, Judith
Trojanowski, Michal
Paperback
Linux Kernel Programming - Second Edition: A comprehensive and practical guide to kernel internals, writing modules, and kernel synchronization
Billimoria, Kaiwan N.
Paperback
Cybersecurity Tabletop Exercises: From Planning to Execution
Hollenberger, John
Lelewski, Robert
Paperback
Scaling Cloud Finops: Proven Strategies to Accelerate Financial Success
Zeier, Matthew
Kanumuri, Sasi
Paperback
Cybersecurity Architect's Handbook: An end-to-end guide to implementing and maintaining robust security architecture
Nichols, Lester
Paperback
Industrial Cybersecurity: Efficiently secure critical infrastructure systems
Ackerman, Pascal
Paperback
Version Control with Git: Powerful Tools and Techniques for Collaborative Software Development
Loeliger, Jon
Ponuthorai, Prem Kumar
Paperback
Mastering Embedded Linux Programming - Third Edition: Create fast and reliable embedded solutions with Linux 5.4 and the Yocto Project 3.1 (Dunfell)
Vasquez, Frank
Simmonds, Chris
Paperback
Linux for Beginners: An Introduction to the Linux Operating System and Command Line
Cannon, Jason
Paperback
Container Security: Fundamental Technology Concepts That Protect Containerized Applications
Rice, Liz
Paperback
Python for Devops: Learn Ruthlessly Effective Automation
Gift, Noah
Behrman, Kennedy
Deza, Alfredo
Paperback
Docker: Up & Running: Shipping Reliable Containers in Production
Matthias, Karl
Kane, Sean P.
Paperback
Building Green Software: A Sustainable Approach to Software Development and Operations
Currie, Anne
Hsu, Sarah
Bergman, Sara
Paperback
Mastering Terraform: A practical guide to building and deploying infrastructure on AWS, Azure, and GCP
Tinderholt, Mark
Paperback
The Ultimate Linux Shell Scripting Guide: Automate, Optimize, and Empower tasks with Linux Shell Scripting
Tevault, Donald a.
Paperback
SAP S/4hana Financial Accounting Certification Guide: Application Associate Exam
Pougkas, Stefanos
Paperback
Data Quality Fundamentals: A Practitioner's Guide to Building Trustworthy Data Pipelines
Vorwerck, Molly
Moses, Barr
Gavish, Lior
Paperback
Practical Linux System Administration: A Guide to Installation, Configuration, and Management
Hess, Kenneth
Paperback
Active Directory: Designing, Deploying, and Running Active Directory
Allen, Robbie
Desmond, Brian
Richards, Joe
Paperback
Kubernetes - An Enterprise Guide - Third Edition: Master containerized application deployments, integrate enterprise systems, and achieve scalability
Boorshtein, Marc
Surovich, Scott
Paperback
Wireless Exploits And Countermeasures: Kali Linux Nethunter, Aircrack-NG, Kismet, And Wireshark
Botwright, Rob
Paperback
Mastering Linux Administration - Second Edition: Take your sysadmin skills to the next level by configuring and maintaining Linux systems
Calcatinge, Alexandru
Balog, Julian
Paperback
Effective Devops: Building a Culture of Collaboration, Affinity, and Tooling at Scale
Daniels, Ryn
Davis, Jennifer
Paperback
The Ultimate Docker Container Book - Third Edition: Build, test, ship, and run containers with Docker and Kubernetes
Schenker, Gabriel N.
Paperback
Kubernetes Security and Observability: A Holistic Approach to Securing Containers and Cloud Native Applications
Gupta, Amit
Creane, Brendan
Paperback
DevOps Automation Cookbook: Harness the power of DevOps with 125+ automation recipes (English Edition)
Kumar Singirikonda, Ekambar
Paperback
Business Continuity and Disaster Recovery Planning for IT Professionals
Snedaker, Susan
Rima, Chris
Paperback
Kubernetes Best Practices: Blueprints for Building Successful Applications on Kubernetes
Burns, Brendan
Villalba, Eddie
Strebel, Dave
Paperback
Industrial Cybersecurity - Second Edition: Efficiently monitor the cybersecurity posture of your ICS environment
Ackerman, Pascal
Paperback
Learning Malware Analysis: Explore the concepts, tools, and techniques to analyze and investigate Windows malware
K. a., Monnappa
Paperback
Ansible: Up and Running: Automating Configuration Management and Deployment the Easy Way
Meijer, Bas
Hochstein, Lorin
Moser, René
Paperback
Cisa Certified Information Systems Auditor All-In-One Exam Guide, Fourth Edition
Gregory, Peter H.
Paperback
Cryptography Algorithms - Second Edition: Explore New Algorithms in Zero-knowledge, Homomorphic Encryption, and Quantum Cryptography
Bertaccini, Massimo
Paperback
Cybersecurity Ops with Bash: Attack, Defend, and Analyze from the Command Line
Albing, Carl
Troncone, Paul
Paperback
Building Secure and Reliable Systems: Best Practices for Designing, Implementing, and Maintaining Systems
Beyer, Betsy
Adkins, Heather
Blankinship, Paul
Paperback
Windows Internals: System Architecture, Processes, Threads, Memory Management, and More, Part 1
Yosifovich, Pavel
Russinovich, Mark
Ionescu, Alex
Paperback
A Friendly Guide to Data Science: Everything You Should Know about the Hottest Field in Tech
Vincent, Kelly P.
Paperback
PowerShell for Penetration Testing: Explore the capabilities of PowerShell for pentesters across multiple platforms
Blyth, Andrew
Paperback
Small, Sharp Software Tools: Harness the Combinatoric Power of Command-Line Tools and Utilities
Hogan, Brian
Paperback
Powershell Cookbook: Your Complete Guide to Scripting the Ubiquitous Object-Based Shell
Holmes, Lee
Paperback
NMAP Network Scanning Series: Network Security, Monitoring, And Scanning Library
Botwright, Rob
Paperback
The Ridiculously Simple Guide to Gmail: The Absolute Beginners Guide to Getting Started with Email
La Counte, Scott
Paperback
Hands-On Network Programming with C: Learn socket programming in C and write secure and optimized network code
Van Winkle, Lewis
Paperback
Learning Microsoft Intune: Unified Endpoint Management with Intune & the Microsoft 365 product suite
Duffey, Scott
Paperback
Cisco Certified Devnet Associate Devasc 200-901 Official Cert Guide
Jackson, Chris
Iliesiu, Adrian
Gooley, Jason
Hardcover
The Self-Service Data Roadmap: Democratize Data and Reduce Time to Insight
Uttamchandani, Sandeep
Paperback
Active Directory: Network Management Best Practices For System Administrators
Botwright, Rob
Paperback
Linux: The ultimate guide to Linux for beginners, Linux hacking, Linux command line, Linux operating system, and more!
Newport, Craig
Paperback
LINUX Beginner's Crash Course: Linux for Beginner's Guide to Linux Command Line, Linux System & Linux Commands
Start Guides, Quick
Paperback
Red Hat Enterprise Linux 9 Administration - Second Edition: A comprehensive Linux system administration guide for RHCSA certification exam candidates
Colino, Miguel Pérez
Requena, Pedro Ibáñez
Gómez, Pablo Iranzo
Paperback
Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning
Suthaharan, Shan
Hardcover
Learn Linux Quickly: A Comprehensive Guide for Getting Up to Speed on the Linux Command Line (Ubuntu)
Quickly, Code
Bartley, Paul H.
Paperback
Getting Started With Ubuntu OS: A Ridiculously Simple Guide to the Linux Open Source Operating System
La Counte, Scott
Paperback
Istio: Up and Running: Using a Service Mesh to Connect, Secure, Control, and Observe
Calcote, Lee
Butcher, Zack
Paperback
The Kubernetes Bible - Second Edition: The definitive guide to deploying and managing Kubernetes across cloud and on-prem environments
Madapparambath, Gineesh
McKendrick, Russ
Paperback
Security Monitoring with Wazuh: A hands-on guide to effective enterprise security using real-life use cases in Wazuh
Gupta, Rajneesh
Paperback
Architecting for Scale: How to Maintain High Availability and Manage Risk in the Cloud
Atchison, Lee
Paperback
The Enterprise Big Data Lake: Delivering the Promise of Big Data and Data Science
Gorelik, Alex
Paperback
The Myths of Security: What the Computer Security Industry Doesn't Want You to Know
Viega, John
Paperback
Result Page Generation for Web Searching: Emerging Research and Opportunities
Alli, Mostafa
Paperback
The Well-Grounded Java Developer, Second Edition
Evans, Benjamin
Clark, Jason
Verburg, Martijn
Paperback
E-mail: A Write It Well Guide: How to Write and Manage E-mail in the Workplace
Chan, Janis Fisher
Paperback
Mastering Linux Shell Scripting - Second Edition: A practical guide to Linux command-line, Bash scripting, and Shell programming
Ebrahim, Mokhtar
Mallett, Andrew
Paperback
Linux Device Drivers Development: Develop customized drivers for embedded Linux
Madieu, John
Paperback
Online Searching: A Guide to Finding Quality Information Efficiently and Effectively
Markey, Karen
Paperback
Linux Observability with Bpf: Advanced Programming for Performance Analysis and Networking
Calavera, David
Fontana, Lorenzo
Paperback
Making Data Visual: A Practical Guide to Using Visualization for Insight
Fisher, Danyel
Meyer, Miriah
Paperback
Okta Administration Up and Running - Second Edition: Drive operational excellence with IAM solutions for on-premises and cloud apps
Vries, Henkjan de
Stjernlöf, Lovisa Stenbäcken
Paperback
Devops with Openshift: Cloud Deployments Made Easy
Picozzi, Stefano
Hepburn, Mike
O'Connor, Noel
Paperback
Building Virtual Pentesting Labs for Advanced Penetration Testing, Second Edition
Cardwell, Kevin
Paperback
Desktop Support Crash Course: Technical Problem Solving And Network Troubleshooting
Botwright, Rob
Paperback
Essential Linux Commands: 100 Linux commands every system administrator should know
Olushile, Paul
Paperback
Microsoft Security Operations Analyst Exam Ref SC-200 Certification Guide: Manage, monitor, and respond to threats using Microsoft Security Stack for
Anich, Joe
Stuart, Trevor
Paperback