Text Mining with R: A Tidy Approach

Text Mining with R

A Tidy Approach
 
Edition number: 1
Publisher: O?Reilly
Date of Publication:
Number of Volumes: Print PDF
 
Normal price:

Publisher's listprice:
GBP 31.99
Estimated price in HUF:
15 451 HUF (14 715 HUF + 5% VAT)
Why estimated?
 
Your price:

13 906 (13 244 HUF + 5% VAT )
discount is: 10% (approx 1 545 HUF off)
The discount is only available for 'Alert of Favourite Topics' newsletter recipients.
Click here to subscribe.
 
Availability:

Estimated delivery time: In stock at the publisher, but not at Prospero's office. Delivery time approx. 3-5 weeks.
Not in stock at Prospero.
Can't you provide more accurate information?
 
  Piece(s)

 
 
 
 
Product details:

ISBN13:9781491981658
ISBN10:1491981652
Binding:Paperback
No. of pages:194 pages
Size:233x177x15 mm
Weight:272 g
Language:English
0
Category:
Long description:

Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you&&&8217;ll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You&&&8217;ll learn how tidytext and other tidy tools in R can make text analysis easier and more effective.

The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You&&&8217;ll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media.

  • Learn how to apply the tidy text format to NLP
  • Use sentiment analysis to mine the emotional content of text
  • Identify a document&&&8217;s most important terms with frequency measurements
  • Explore relationships and connections between words with the ggraph and widyr packages
  • Convert back and forth between R&&&8217;s tidy and non-tidy text formats
  • Use topic modeling to classify document collections into natural groups
  • Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages