• Posted by Elaine Ellis, Marketing
No Comments

Big Boulder is just over a month away, and we’re excited to announce seven incredible new speakers to the Big Boulder agenda. When we started planning the first social data conference, we wanted to put together a world class speaker list. We’ve been thrilled by the response and are excited to add speakers from companies such as Tumblr and Get Satisfaction. We’re also working on some really interesting panels so keep your eye out for more to come!

Below is a list of our latest additions, and you can also see the complete list of speakers.

If you’d like to attend, but aren’t a Gnip customer, we’re looking for volunteers to help with photography and live blogging.

  • Posted by Elaine Ellis, Marketing
No Comments

During natural disasters and health epidemics, social data provides a valuable window into what is happening on the ground. Social data can help track diseases and serve as a powerful method of communication letting people know what is happening. Healthcare is just beginning to understand the value of big data.

Google first gained attention for helping to predict influenza outbreaks based on search volume but scientists from Bristol University took it a step further with Twitter — they analyzed 50 million geo-tagged tweets related to flu and their results nearly perfectly correlated with national health statistics from the CDC.

Social data went a step further when scientists used Twitter to predict cholera outbreaks in Haiti. In 2010, a 7.0 earthquake ravaged the small island of Haiti, a third-world country with meager infrastructure. The island began preparing for disease outbreaks, specifically cholera. During the height of the cholera outbreak, the island was losing people at the rate of 50 people a day and a total of more than 3,000 people were killed that year. Scientists began tracking cholera outbreaks via Twitter and found that social data could beat the official reports by up to two weeks. As one of the scientists noted, “Not all cholera patients go to hospitals to be counted officially.”

This isn’t the first time that social data has played an important role after an earthquake.  During the aftermath of Japan’s earthquake in 2011, social media played an important role in helping to identify how people could obtain medicine for their chronic illnesses that still needed treatment but so many of the traditional pharmaceutical supply chains were cut off.

Like in the Japanese earthquake, there is immense potential for social data to help emergency responders during a disaster. During the Fourmile Canyon Fire in Boulder in 2010, VisionLink provided fire crews and managers a real-time view into what was happening on the ground by layering geo-tagged Tweets and Flickr images they got from Gnip onto a Google map of the area. With this information, emergency workers were better able to see what was happening on the ground.

Studies have proven that people rely on information communicated via social media in emergencies. A study from the University of Western Sydney found that people relied on official news channels and social media and shared and re-tweeted the most useful information. The authors of the study believe that social media helps people to not feel alone as well as to help getting the message out. Which also echoes a study put together by the Red Cross that shows one-third of the U.S. population said they would use social media to let their loved ones know they were safe. Which is exactly why Red Cross has created their own social media control room powered by Radian6.

If you’d like to learn more about social data in natural disasters, contact info@gnip.com.

VisionLink Map of Boulder Fires

  • Posted by Chris Moody, President and COO
5 Comments

I’m thrilled to announce that the full firehose of public Tumblr posts is now available exclusively from Gnip. Tumblr is one of the fastest growing social networks in the world. Much of this growth is fueled by the enormous number of conversations that are unique to the Tumblr community. These conversations cover a huge range of subjects, from movies, TV shows and fashion to business, apparel and consumer products. Check out these stats to get a feel for the volume of discussion on Tumblr:

  • 50 million new posts every day
  • 15 billion page views every month
  • 20 billion total posts
  • 300% traffic growth last year

While some social platforms react quickly to news and other events, Tumblr conversations often spread around concepts and trends. Take the example of Urban Outfitters where a photographer posted a picture to her personal Tumblr of a piece from one of their new collections. That post received over 1,000 notes and almost no mention elsewhere. In the case of Land Rover, the company posted a picture of a dog riding in a Land Rover to their Tumblr that received more than 5,000 notes and very little mention on other networks.

It doesn’t take a large leap to see the impact this type of information can have on brand management and product development. The conversations on Tumblr are rich in images and discussion about brands and products, from simply sharing a picture about a favorite pair of shoes to reblogging news about favorite brand. And given the highly social nature of the Tumblr community, these discussions move quickly and broadly through the community. You often see posts that are shared tens of thousands of times. For brands, every conversation matters and access to the full firehose ensures they won’t miss a thing.

We’re excited to be able to offer Tumblr to our customers and can’t wait to see what other intriguing use cases they find for this data.

Drop us a line at sales@gnip.com to learn more.

  • Posted by Jud Valeski, Co-Founder and CEO
1 Comment

Gnip moves billions of real-time public social activities to its customers everyday. Doing that efficiently, accurately, and reliably is an awesome challenge. Gnip also has a lot of product roadmap to build-out. We’re hiring across the board, but I wanted to provide some insight into the software side of things to give any software construction minded readers of this post a sense of the interesting technical challenges you’d get to work on if you were on our development team.

Big Streams: Long-lived, Stateful, Variable Throughput TCP Connections
While our web-app (Rails) looks/feels much like a typical web-app, looks can be deceiving. Our core system (Java) has uncommon TCP (HTTP for the most part) connection challenges that we’re constantly applying operational and business logic creativity to. We often describe the connectivity scenario as akin to a video streaming system (Netflix for example). The load balancing, tuning, connection handling logic (often in code (Java and/or C)), restart, buffering, flushing games we get to play keep one’s brain thoroughly engaged. Imagine getting to write code that runs in a pipeline moving at several hundred Mbps (sustained).

Big CPU: Filtering
We’ve built a real-time equivalent parallel to age-old SQL. Something powerful and efficient, yet simple and intuitive to use. The world has grown up with SQL on their minds, yet it generally doesn’t apply to the world of high-volume real-time nature of public social data streams. We spend a lot of time and energy crafting the language, as well as building the infrastructure underneath to ensure it can operate on a message (from small Tweets, to large blog posts) in ones of milliseconds. Very powerful. Very fun. We’d love your help in evolving this part of the system with us.

Big Blending: Enrichments
As data moves through Gnip’s infrastructure we do a variety of things to it. As a couple examples, we will enrich it with Klout scores and typed language classification scores. We’ll also unwind all those opaque URLs (and allow you to filter against the result) for our customers, so they don’t have to stand up horizontally scaled/parallelized infrastructure to do so on their end. We have a long list of enrichments we’re in the process of adding, and we’d like to do so with as minimal latency impact as technically possible. Speed matters. There’s amazing opportunity in the industry to blend other datasets into “the stream.” Help us do this.

Big Data: Historical
There are three organizations on Earth that have the complete public Twitter corpus on their servers. Gnip, Twitter & the Library of Congress. Gnip has dipped our toes into the “historical” offering with our 30-day reply product. Doing so has allowed us to meet many of the business critical backfill requirements of our customers. That said, there is still a huge opportunity in the historical space. We leverage a variety of parallelizable data access, query, and filter technologies at Gnip, but want more horsepower here. We’d love to have more people on the team with practical experience around map-reduce based data access models. Gnip personifies “Big Data” challenges and solutions. Show us what you’ve got (beyond textbook and academic understanding of the latest trends in this space)!

Big Customers: Impact
We’ve been delivering public social data for four years and serve the biggest, most demanding customers in the space. From 8 of the 9 largest social media monitoring companies to some of the largest hedge funds in the world, Gnip is expected to provide a bulletproof solution. Through our customers, Gnip serves more than 90% of the Fortune 500. We’re not just about “Big Ideas,” we’re about “Big Impact.” Want to work in a place where what you do matters? This is it.

Our Approach
We take an “every message matters” approach to our products. As a result, our development team is tightly coupled (one-and-the-same for the most part) with the production operation of the system. We don’t have a wall between the software being built and that software being run. Gnippers who write code also operate that code throughout the local, review, staging, and production environments. We believe this approach yields a better service for our customers; higher quality, better reliability, better consciousness. There is a strong sense of collective code ownership, as opposed to folks siloing into “their area.” This results in everyone’s creative and talented mind having an opportunity to impact the entire system, and subsequently yields better, more consistent, product.

jobs@gnip.com

  • Posted by Chris Moody, President and COO
No Comments

If you’re a large retail chain and people are talking about your brand, you want to know that immediately. But what if your brand is being talked about in Chinese on the largest microblogging service in China with 300 million members?

For large Western brands with a presence in China, understanding what is happening on Sina Weibo is just as important as understanding any social media channel in English speaking locales. It’s a legitimate problem as evidenced by KFC recently squelching rumors about its delivery service or the Red Cross having to address misuse of funds by a supposed employee on Sina Weibo. Ogilvy even did a study on the importance of responding to crises due to how quickly conversations spread on Weibos, as it’s extremely viral. For KFC, one Weibo post was forwarded 41,000 times and picked up 5,000 comments.

Monitoring plays an important role in crisis communications and brand awareness, and Gnip is proud to announce that we are now offering Sina Weibo as part of our Enterprise Data Collector offering. This makes it seamless for Gnip’s social media clients to begin offering public Sina Weibo activities to their customers.

With 500 million Internet users, China has the largest population of Internet users on the planet and there are 300 million people using Sina Weibo that are potential customers for Western brands having conversations that shouldn’t be missed. The user base of Sina Weibo is extremely diverse and includes celebrities, executives, politicians and advertisers. According to a joint study with Sina Weibo and social media intelligence firm CIC estimate that some 50,000 companies have signed up on Sina Weibo to date. Why? Because 95% of surveyed Chinese believe that brands with a presence on Sina Weibo are more trustworthy.

More Facts About Sina Weibo:

  • Brands on Sina Weibo include Ikea, L’Oreal, Louis Vuitton, Dell, Coca Cola, Nike, Adidas, Burberry, Starbucks, Remy Martin and ZZ.
  • The most Weibos ever sent was an astounding 32,312 messages per second were sent during the Chinese New Year period.
  • Sixty-one percent of social networking users in China have made a purchase because of a digital campaign.
  • Sina Weibo has 57% of the Weibo market based on active users and 87% based on browsing time.
  • The most popular Weibo personality, Chinese actress Chen Yao has more than 17 million followers.
  • Notable personalities using Sina Weibo include Emma Watson, Tom Cruise, Bill Gates, Paris Hilton, San Francisco Mayor Ed Lee and Radiohead.
  • Nearly half a million of Sina Weibo users are from the United States.

If you’re interested in looking at understanding and analyzing the public Chinese social media market, please email sales@gnip.com for more information.

Ikea on Sina Weibo

  • Posted by Chris Moody, President and COO
No Comments

Social data is an emerging new industry and enterprises are applying social data to solve challenges in both the business world and the public sector. And while there are a wide range of conferences to discuss big data or social media marketing, there hasn’t been a place for the people and organizations working with social data to discuss best practices, challenges, trends and developments.

We decided to change that. Gnip is excited to launch Big Boulder, the first conference dedicated to social data and the myriad use cases surrounding it.  Big Boulder will take place on June 21-22 in beautiful Boulder, Colorado.

Our speaker lineup is incredible and we’re proud of the caliber of talent we’re attracting to our inaugural conference. Confirmed speakers include Dave Morin of Path, Ryan Sarver and Doug Williams of Twitter, Sean Bruich of Facebook, Daniel Ha and Ro Gupta of Disqus, Howard Lindzon of StockTwits, Joe Fernandez and Matt Thomson of Klout, Zach Hofer-Shall of Forrester and Bryan Hyndman of NASDAQ. And this is just the start.

In fact, I’m excited to announce two new additions to our speaking roster, Scott Nicholson, data scientist at LinkedIn, and Wendy Lea, CEO of Get Satisfaction. We’re thrilled to add them to the agenda and look forward to announcing more new speakers in the coming weeks.

Social data is in early stages but it’s already being used to power a range of incredible use cases including predicting stock markets, breaking the news of earthquakes, and helping emergency responders in Japan find medicine after last year’s earthquake. Big Boulder will have both fireside chats with the leading social publishers and panels discussing the use of social data in the public sector and financial markets, geo-spatial applications of social media data and social data analytics.

Being located in Boulder has been amazing for Gnip, and we’re thrilled to show off our hometown. Boulder has been frequently recognized for our thriving high tech community. Not to mention, Boulder has been named the foodiest, healthiest, happiest and the best town for startups.

Big Boulder will be a deliberately intimate conversation, so attendance will be by invitation only for Gnip’s partners and customers. Check out www.bigboulderconf.com to learn more.

  • Posted by Adam Tornes, Product
No Comments

Our customers tell us that getting every single Tweet that matters is one of the key reasons they work with Gnip. And sometimes getting every Tweet that matters means filtering out the Tweets you don’t want. With this in mind, I’m happy to announce the introduction of two new operators to our Power Track filtering suite.

Retweet Operator

The Retweet operator allows a customer to ensure only Retweets that match a rule are delivered or excluded.

To use the Retweet operator, simply add is:retweet or –is:retweet to any rule.

Examples Include:

  • Receive only Retweets mentioning Apple using a rule like: apple is:retweet as a way to measure engagement of the brand’s fan base

or

  • Get only Tweets with unique content about Apple using a rule like: apple -is:retweet to monitor conversation about the brand and ignore the tremendous volume of retweets generated by the brand

Sampling Operator

The Sampling operator allows a customer to receive a random sample of Tweets that match a rule rather than the entire set of Tweets.

There are several use cases where the Sampling operator is useful.  Say you want to stay within a budgeted number of Tweets each month, but you’re trending higher than that budget halfway through the month.  With the Sampling operator, you can scale back your consumption without fully eliminating rules.  In another use case you might want to monitor a very high-volume rule or user, but your internal systems can’t handle this volume.  Sampling makes this more manageable.  Finally, there are times when you simply need to know the directional volumes for things, and don’t need every tweet.

To use the Sampling operator, add sample:## to any rule with an integer value between 1 to 100. The Sampling operator applies to the entire rule and requires any “OR’d” terms be grouped.

Examples Include:

  • Receive a sampling of 10% of all Tweets that contain “apple” using a rule like:

apple sample:10

or

  • Receive a sampling of 50% of all Tweets that contain “iPad” or “iPhone” using a rule like:

(ipad OR iphone) sample:50

As always, thank you for the product feedback and keep it coming.  Additional documentation of these new operators and others can be found in our online documentation.

 

Back From Strata

March 7th, 2012
  • Posted by Elaine Ellis, Marketing
1 Comment

If you didn’t have a chance to attend the recent Strata conference, our COO and president, Chris Moody, did an interview with Alex Howard of O’Reilly.

Joe Brockmeier of ReadWriteWeb had an excellent article recapping Chris Moody’s presentation on the social media cocktail. The ProgrammableWeb also talks about how Gnip was first to market with Disqus data.

  • Posted by Chris Moody, President and COO
1 Comment

Imagine going to a dinner party and listening to the first thing each person said. You’d learn a few things, but you’d miss out on the meat of the conversation that happens in the give and take of the dialogue.

In the world of online public social conversation, blog posts are the monologue and comments provide the dialogue. Each is valuable on their own, but to see the complete picture, you need both. Conversations happen in comments, and it has been a huge struggle for brands to be able to keep up with comments to fill in their understanding of this key piece of the conversation.

I’m excited to announce that we’re making it easier to access these public conversations with the addition of Disqus to our publisher portfolio. As the largest third-party commenting platform in the world with 70 million commenter profiles, Disqus provides coverage of more than 500,000 comments every day, spanning almost every topic imaginable and reaching over 700 million readers each month.

Comments last forever. They appear in search results and remain part of the discussion long after the day they were written.  With their staying power and depth of discussion, the commenting ecosystem provides an important — and different — social signal. Disqus further embodies this by allowing users to react to others’ comments with up or down “votes” creating significantly more engagement. The 2 million “votes” on Disqus each day provide insight into what comments are generating the most reaction.

Our API partnership provides authorized access for the first time ever to full firehoses of discussion content and interaction across the Disqus network. To the extent that any of this data has been available before, it’s been provided by technologies like content scraping/crawling that pulled pieces of the discussion, but did not guarantee full coverage in real time on a publisher-safe, consistent and reliable basis. Because this new service is being provided via a direct partnership with Disqus, with Gnip’s full firehose, you get low-latency streams that provide full coverage with the support of the publisher to ensure the availability of the data over the long term.

We’re thrilled to have data from Disqus available on our platform and can’t wait to see the amazing ways that our customers are able to apply it to their businesses. Email us at sales@gnip.com to learn more about Disqus and set up a trial so you can see the data for yourself.

Follow Gnip

Search

Archive

Recent Posts
Categories
Tags
Blogroll

Recent Tweets

Switch to our mobile site