<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Gnip Blog</title>
	<atom:link href="http://blog.gnip.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.gnip.com</link>
	<description>Social media data tracking, updates from Twitter, Facebook, and other publishers, Gnip product updates, and more.</description>
	<lastBuildDate>Thu, 02 Feb 2012 16:48:14 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.2</generator>
		<item>
		<title>Enhanced Filtering for PowerTrack</title>
		<link>http://blog.gnip.com/enhanced-filtering-for-powertrack/</link>
		<comments>http://blog.gnip.com/enhanced-filtering-for-powertrack/#comments</comments>
		<pubDate>Thu, 02 Feb 2012 16:48:14 +0000</pubDate>
		<dc:creator>Adam Tornes, Product</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Product]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[data filtering]]></category>
		<category><![CDATA[firehose]]></category>
		<category><![CDATA[gnip]]></category>
		<category><![CDATA[language filters]]></category>
		<category><![CDATA[power track]]></category>
		<category><![CDATA[tweets]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2607</guid>
		<description><![CDATA[Gnip is always looking for ways to improve its filtering capabilities and customer feedback plays a huge role in these efforts.  We are excited to announce enhancements to our PowerTrack product that allow for more precise filtering of the Twitter Firehose, a feature enhancement request that came directly from you, our customers. Gnip PowerTrack rules [...]]]></description>
			<content:encoded><![CDATA[<div>
<p>Gnip is always looking for ways to improve its filtering capabilities and customer feedback plays a huge role in these efforts.  We are excited to announce enhancements to our PowerTrack product that allow for more precise filtering of the Twitter Firehose, a feature enhancement request that came directly from you, our customers.</p>
<p>Gnip PowerTrack rules now support OR and Grouping using ().  We have also loosened limitations on the number of characters and the number of clauses per rule. Specifically, a single rule can now include up to 10 positive clauses and up to 50 negative clauses (previously 10 total clauses).  Additionally, the character limit per rule has grown from 255 characters to 1024.</p>
<p>With these changes, we are now able to offer our customers a much more robust and precise filtering language to ensure you receive the Tweets that matter most to you and your business.  However, these improvements bring their own set of specific constraints that are important to be aware of.  Examples and details on these limitations are as follows:</p>
<p><strong>OR and Grouping Examples</strong></p>
<ul>
<li>apple OR microsoft</li>
<li>apple (iphone OR ipad)</li>
<li>apple computer –(fruit OR green)</li>
<li>(apple OR mac) (computer OR monitor) new –fruit</li>
<li>(apple OR android) (ipad OR tablet) –(fruit green microsoft)</li>
</ul>
<p><strong>Character Limitations</strong></p>
<ul>
<li>A single rule may contain up to 1024 characters including operators and spaces.</li>
</ul>
<p><strong>Limitations</strong></p>
<ul>
<li>A single rule must contain at least 1 positive clause</li>
<li>A single rule supports a max of 10 positive clauses throughout the rule</li>
<li>A single rule supports max of 50 negative clauses throughout the rule</li>
<li>Negated ORs are not allowed. The following are examples of invalid rules:</li>
<li>-iphone OR ipad</li>
<li>ipad OR -(iphone OR ipod)</li>
</ul>
<p><strong>Precedence</strong></p>
<ul>
<li>An implied &#8220;AND&#8221; takes precedence in rule evaluation over an OR</li>
</ul>
<p>For example a rule of:</p>
<ul>
<li>android OR iphone ipad  would be evaluated as apple OR (iphone ipad)</li>
<li>ipad iphone OR android would be evaluated as (iphone ipad) OR android</li>
</ul>
<p>You can find full details of the Gnip Power Track filtering changes in our<a href="http://docs.gnip.com/w/page/35663947/Power%20Track"> online documentation</a>.</p>
<p>Know of another way we can improve our filtering to meet your needs?  Let us know in the comments below.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/enhanced-filtering-for-powertrack/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Gnip and Automattic Make Whole New Universe of Data Available</title>
		<link>http://blog.gnip.com/gnip-and-automattic-make-whole-new-universe-of-data-available/</link>
		<comments>http://blog.gnip.com/gnip-and-automattic-make-whole-new-universe-of-data-available/#comments</comments>
		<pubDate>Tue, 17 Jan 2012 16:42:14 +0000</pubDate>
		<dc:creator>Bill Adkins, Director of Business Development</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Partners]]></category>
		<category><![CDATA[Product]]></category>
		<category><![CDATA[automattic]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[comments]]></category>
		<category><![CDATA[deep analysis]]></category>
		<category><![CDATA[engagement]]></category>
		<category><![CDATA[firehose]]></category>
		<category><![CDATA[gnip]]></category>
		<category><![CDATA[intensedebate]]></category>
		<category><![CDATA[jetpack]]></category>
		<category><![CDATA[likes]]></category>
		<category><![CDATA[wordpress.com]]></category>
		<category><![CDATA[wordpress.org]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2590</guid>
		<description><![CDATA[&#8220;This new data from Automattic is a big addition and a testament to Gnip&#8217;s commitment to drive the social data economy forward. This is an important source to add to the social data mix, one that we know our customers will take full advantage of.&#8221; - Rob Begg, VP Marketing of Radian6 As social media [...]]]></description>
			<content:encoded><![CDATA[<h4 style="text-align: left;">&#8220;This new data from Automattic is a big addition and a testament to Gnip&#8217;s commitment to drive the social data economy forward. This is an important source to add to the social data mix, one that we know our customers will take full advantage of.&#8221;</h4>
<p style="padding-left: 300px;"><strong>- Rob Begg, VP Marketing of Radian6</strong></p>
<div>As social media data becomes more and more important across a range of businesses, our customers are asking for access to more data sources to give them a more complete picture of the social media conversations that are relevant to their businesses.</p>
<p>Today, we’re excited to <a href="http://gnip.com/pr_gnip_expands_relationship_automattic" target="_blank">announce a major addition</a> to our coverage of the conversations taking place on blogs around the world. We’re expanding our relationship with <a href="http://automattic.com/" target="_blank">Automattic</a> to make a whole new universe of blog and comment data available to the market for the first time anywhere.</p>
<p>For those who don’t know, Automattic is a network of web services including <a href="http://wordpress.com/#!/fresh/" target="_blank">WordPress.com</a>, <a href="http://vip.wordpress.com/" target="_blank">VIP hosting and support</a>, <a href="http://polldaddy.com/" target="_blank">Polldaddy</a>, <a href="http://intensedebate.com/" target="_blank">IntenseDebate</a>, and <a href="http://jetpack.me/" target="_blank">Jetpack</a>. We’ve been delivering data from WordPress.com and IntenseDebate for about a year and a half and found that while our customers loved their data, they always wanted more.</p>
<p>As of today, we are now offering the full firehose of blog posts and comments from <a href="http://jetpack.me/" target="_blank">Jetpack-powered</a> WordPress.org sites, as well as engagement streams of “likes” from WordPress.com and IntenseDebate. The new data from WordPress.org greatly increases the coverage available to those who are looking to do deep analysis of blog posts and comments. The new engagement streams enable companies to pull in reaction data to quickly understand sentiment, relevance and resonance. With this they can gauge the intensity of opinion around fast moving blog and comment conversations, helping prioritize critical response.</p>
<p>Being full firehoses, all of the streams from Automattic ensure 100% coverage in realtime giving customers the peace of mind that they can keep up the entire discussion on fast moving threads.</p>
<p>The scope of coverage offered by Automattic is pretty incredible.  Check out some of these stats:</p>
<ul>
<li><a href="http://wordpress.org/news/2011/08/state-of-the-word/" target="_blank">14.7% of the top million websites</a> in the world are powered by WordPress</li>
<li><a href="http://wordpress.org/news/2011/08/state-of-the-word/" target="_blank">22 out of every 100 new active domains in the US</a> are running WordPress</li>
<li>65% of visits to WordPress powered sites are from outside the US</li>
</ul>
<p>We’re thrilled to be able to offer these new data streams to our customers and can’t wait to see the amazing things they’ll be able to do with them.</p>
</div>
<p><strong>Updated:</strong> Coverage in GigaOM &#8211; <a href="http://gigaom.com/2012/01/17/gnip-wordpress-partnership-expansion/" target="_blank">Gnip and WordPress deepen ties, expand data partnership</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/gnip-and-automattic-make-whole-new-universe-of-data-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Twitter Shouts: Huntsman&#8217;s Out!</title>
		<link>http://blog.gnip.com/twitter-shouts-huntsmans-out/</link>
		<comments>http://blog.gnip.com/twitter-shouts-huntsmans-out/#comments</comments>
		<pubDate>Mon, 16 Jan 2012 23:34:10 +0000</pubDate>
		<dc:creator>Seth McGuire, Director of Asset Management &#38; Financial Technology</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[data streams]]></category>
		<category><![CDATA[firehose]]></category>
		<category><![CDATA[gnip]]></category>
		<category><![CDATA[gop presidential race]]></category>
		<category><![CDATA[jon huntsman]]></category>
		<category><![CDATA[newsgator]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[social media analytics]]></category>
		<category><![CDATA[tweets]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2561</guid>
		<description><![CDATA[At Gnip, one of the most fascinating aspects of social media is ‘speed’ &#8211; specifically in regards to news stories. We continue to see a trend towards the ‘breaking’ of news stories on platforms like Twitter. Both the speed at which a story is broken as well as the speed at which that story catches [...]]]></description>
			<content:encoded><![CDATA[<p>At  Gnip, one of the most fascinating aspects of social media is ‘speed’ &#8211;  specifically in regards to news stories. We continue to see a trend towards the ‘breaking’ of news stories on platforms like Twitter. Both the speed at which a story is broken as well as the speed at which that story catches on show the incredible power of this medium for information exchange. And as we’ve pointed out before, different social  media streams offer different analytical value &#8211; Twitter versus a news feed for example.</p>
<p>Last night proved a great example of this as word of Huntsman’s withdrawal from the GOP presidential race crept out. Interestingly, the news was broken by Peter Hamby, a CNN Political Reporter&#8211;on Twitter.  While CNN followed up on this news a few minutes later, it seems the  reporter (or the network) realized the inherent ‘newswire’ value of  breaking this news as fast as possible&#8230;and used Twitter as part of  their strategy to do so!</p>
<p><a href="https://twitter.com/#!/PeterHambyCNN/status/158733697833639936"><img class="aligncenter" src="https://lh3.googleusercontent.com/0TGdT13Zx7mPUA_oJBjDC6VSNlU9VSwWMrtZrN26Je124Lx33xR-TEknqvU05GZfD93n0Z443CF4Zva9AaOfE3pRjIojZLPX0iWRoLxI6szLF7urvQI" alt="" width="534px;" /></a></p>
<p>This Tweet was followed with what we’ve begun to see as the normal ‘Twitter’ spike for breaking news &#8211; the chart below, built by our Data Scientist Scott, shows how quickly Huntsman withdrawl was retweeted and passed along. When looked at in comparison to an  aggregate news feed (in this case, NewsGator’s Datawire Firehose, which  is a content aggregator derived from crowdsourced rss feeds and contains many articles from traditional media providers), some  interesting comparisons are brought to light.<br />
<a href="http://blog.gnip.com/wp-content/uploads/2012/01/pubfits.png"><img class="aligncenter size-full wp-image-2568" title="pubfits" src="http://blog.gnip.com/wp-content/uploads/2012/01/pubfits1.png" alt="Comparing the pulse of Twitter and NewsGator articles breaking Huntsman's withdrawal from the GOP primary race." width="534px;" /></a><br />
Comparing tweets of &#8220;huntsman&#8221; and news articles breaking Jon Huntsman&#8217;s withdrawal from GOP primary race. The blue curves show the &#8220;Social Activity Pulse&#8221; that characterizes the growth and decay of media activity around this topic.  By fitting the rate of articles or tweets to a function we can compare standard measure such as time-to-peak, store half-life etc.  (More on this in a future post.)  The peak in Twitter is reached about the same time as the first story arrives from NewsGator, over 10 minutes after the story broke on Twitter.</p>
<p>Both  streams show a similar curve in story adoption, peak and tail. What’s  different is the timeframe of the content. Twitter’s data spikes about  10 minutes earlier than NewsGator&#8217;s. NewsGator&#8217;s content is more  in-depth, as it contains news stories and blog posts, but as we&#8217;ve seen in other cases, Twitter is the place where news breaks these days.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/twitter-shouts-huntsmans-out/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tebow Time Scores in Social Media</title>
		<link>http://blog.gnip.com/tebow-time-scores-in-social-media/</link>
		<comments>http://blog.gnip.com/tebow-time-scores-in-social-media/#comments</comments>
		<pubDate>Mon, 09 Jan 2012 04:18:01 +0000</pubDate>
		<dc:creator>Randy Almond, Marketing</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Just For Fun]]></category>
		<category><![CDATA[broncos]]></category>
		<category><![CDATA[football]]></category>
		<category><![CDATA[nfl]]></category>
		<category><![CDATA[social media data]]></category>
		<category><![CDATA[steelers]]></category>
		<category><![CDATA[tim tebow]]></category>
		<category><![CDATA[traffic spike]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2552</guid>
		<description><![CDATA[Wow, what a game!  If you missed the instant classic that was the Broncos/Steelers overtime game tonight, check out the recap. When Tim Tebow connected with Demaryius Thomas on an 80-yard touchdown pass on the first play of overtime, we saw a noticeable spike in the overall volume of social media messages flowing through the [...]]]></description>
			<content:encoded><![CDATA[<p>Wow, what a game!  If you missed the instant classic that was the Broncos/Steelers overtime game tonight, check out <a href="http://scores.espn.go.com/nfl/recap?gameId=320108007" target="_blank">the recap</a>.</p>
<p>When Tim Tebow connected with Demaryius Thomas on an 80-yard touchdown pass on the first play of overtime, we saw a noticeable spike in the overall volume of social media messages flowing through the Gnip platform.</p>
<p>Tebow Time!</p>
<p><a href="http://blog.gnip.com/wp-content/uploads/2012/01/Tebow-Time.png"><img class="alignnone size-full wp-image-2553" title="Tebow-Time" src="http://blog.gnip.com/wp-content/uploads/2012/01/Tebow-Time.png" alt="Spike in Social Media Mentions when Tim Tebow Throws Winning Touchdown against the Steelers" width="414" height="334" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/tebow-time-scores-in-social-media/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Are Facebook Users More Optimistic than Twitter Users?</title>
		<link>http://blog.gnip.com/are-facebook-users-more-optimistic-than-twitter-users/</link>
		<comments>http://blog.gnip.com/are-facebook-users-more-optimistic-than-twitter-users/#comments</comments>
		<pubDate>Thu, 05 Jan 2012 18:56:26 +0000</pubDate>
		<dc:creator>Scott Hendrickson, Data Science</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[realtime data]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[tweets]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2546</guid>
		<description><![CDATA[New Year&#8217;s Eve gives us a sense of closure on the past and an opportunity to make new dreams. With the emergence of social media, we can now see these reflections and resolutions transpire in realtime. As we observed the posts, comments, and tweets related to the New Year, we saw the typical expressions on [...]]]></description>
			<content:encoded><![CDATA[<p>New Year&#8217;s Eve gives us a sense of closure on the past and an opportunity to make new dreams. With the emergence of social media, we can now see these reflections and resolutions transpire in realtime. As we observed the posts, comments, and tweets related to the New Year, we saw the typical expressions on Facebook and Twitter of best wishes for the coming year and pithy observations about the past year. What we didn’t expect was that users of the two popular social media sites would have different outlooks on the world.</p>
<p>As we enter 2012, Facebook users are more optimistic than Twitter users.</p>
<p>You’re probably wondering how we can say that. Well, we looked at all of the public posts on Facebook and Tweets on Twitter that contained “Happy New Year.” For all of those posts and Tweets, we compared the use of positive words such as &#8220;better&#8221; and &#8220;good&#8221; to the use of negative words such as &#8220;worse&#8221; and &#8220;bad.&#8221; We found that Tweets with positive words appeared 8 times more frequently than Tweets with negative words. You might be thinking a ratio of 8 to 1 is pretty optimistic&#8230;</p>
<p>It may be, but posts on Facebook had a ratio of 40 to 1&#8211;such a huge difference lead us to speculate that Facebook is a more optimistic place than Twitter.</p>
<p>Interesting stuff. Could be a variety of reasons for the difference, from the mix of users on each service to the fact that Facebook is used to communicate with friends, while Twitter is user to broadcast to followers. We’ll leave the speculation up to you.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/are-facebook-users-more-optimistic-than-twitter-users/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Social Media Knows As Much About The Holidays As Santa Does</title>
		<link>http://blog.gnip.com/social-media-knows-as-much-about-the-holidays-as-santa-does/</link>
		<comments>http://blog.gnip.com/social-media-knows-as-much-about-the-holidays-as-santa-does/#comments</comments>
		<pubDate>Mon, 19 Dec 2011 19:43:45 +0000</pubDate>
		<dc:creator>Seth McGuire, Director of Asset Management &#38; Financial Technology</dc:creator>
				<category><![CDATA[Industry]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[black friday]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[holiday]]></category>
		<category><![CDATA[inventory management]]></category>
		<category><![CDATA[IR]]></category>
		<category><![CDATA[mashable]]></category>
		<category><![CDATA[mr. youth]]></category>
		<category><![CDATA[operational planning]]></category>
		<category><![CDATA[product feedback]]></category>
		<category><![CDATA[revenue forecasting]]></category>
		<category><![CDATA[sales and marketing planning]]></category>
		<category><![CDATA[supply chain]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[wordpress]]></category>
		<category><![CDATA[wsj]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2530</guid>
		<description><![CDATA[The holidays are an exciting time at Gnip&#8230;and not just because our CEO loves bringing random bottles of excellent Scotch to the office. Around this time of year we get some visibility into the incredible ways our retail and consumer product clients are using social data. In fact, Mashable recently highlighted a study by Mr. [...]]]></description>
			<content:encoded><![CDATA[<p dir="ltr">The  holidays are an exciting time at Gnip&#8230;and not just because our CEO  loves bringing random bottles of excellent Scotch to the office. Around  this time of year we get some visibility into the incredible ways our  retail and consumer product clients are using social data. In fact,  Mashable <a href="http://mashable.com/2011/12/14/social-media-holiday-purchases/">recently highlighted</a> a study by <a href="http://www.mryouth.com/">Mr. Youth</a> (a marketing firm) with an incredible stat that helps prove how valuable this data truly is:</p>
<p style="padding-left: 30px" dir="ltr"><em>“66%  of respondents who bought something on Black Friday did so as a direct  result of social media interactions with friends and family.”</em></p>
<p>While  that stat speaks to the impact social media has upon us as individuals,  think more broadly about how powerful it is to analyze that data in  aggregate, in real-time. Companies are leveraging data from WordPress  blogs, Twitter mentions, Facebook likes and multiple other sources to  inform critical realtime decisions for inventory management and  operational planning, sales and marketing planning, revenue forecasting,  and many others.</p>
<p><span style="text-decoration: underline">Example Scenario for Using Social Data:</span> It’s holiday time, 2011. Your company begins to aggregate ‘mentions’ of  a new product from Twitter, Facebook, WordPress blogs in realtime. You  take that data and analyze it for # of mentions about the new product,  geography of posts (where available), demographic information within  user profiles (what keywords are most consistent within Twitter user  profiles that mentioned your product?), etc.</p>
<p>You  spread that data among multiple divisions, providing additional  forecast, regional buying pattern, and customer habit data. Your teams  use that to:</p>
<ol>
<li><strong>Manage supply chain:</strong> Redirect inventory to areas with highest potential sales and (depending  on how far out you are) use as a data point in the S&amp;OP system for  manufacturing forecasts to keep ahead of the holiday demand.</li>
<li><strong>Target marketing spend:</strong> Use regional buying patterns and customer habit data to inform what  demographic you are, and aren’t, hitting. Do you need to reposition your  marketing plan?</li>
<li><strong>Incorporate product feedback:</strong> Are there consistent reasons why people are buying your product &#8211; or  why they aren’t? Information on quality, packaging, price, etc will be  incredibly valuable for future products.</li>
<li><strong>Calibrate investor expectations:</strong> Inform your IR team of potential positive/negative performance feedback to give them running room ahead of any announcements.</li>
</ol>
<p>Those  are just some of the more common use cases we’re seeing. But new  opportunities are popping up on a daily basis. We spotted this gem in a  recent WSJ article <a href="http://online.wsj.com/article/SB10001424052970204026804577098451316357124.html?mod=WSJ_hp_MIDDLENexttoWhatsNewsTop">about finding a parking space</a> during crazy shopping times:</p>
<p style="padding-left: 30px" dir="ltr"><em>Bud  Kleppe, a real-estate agent in St. Paul, Minn., watches Mall of  America&#8217;s Twitter feed for parking updates. (The mall sends them out  under the hash tag #moaparking.)</em></p>
<p>Imagine  collecting data from update systems like this and using it measure  parking turnover across prime shopping days. Now, overlay the turnover  of spots in specific sections against a map of stores and you have some  interesting potential for data on economic performance and forecasting.  When incorporated with other traditional retail data and compared on a  store-to-store basis, you’ve built a unique and realtime analysis tool.</p>
<p>You’re  only limited by your imagination in how you can apply social media data  to you business. The more software developers, corporations, and people  use social media, and the more things they use it for (like parking  updates!), the greater the possible use cases for analysis of that data  and the more valuable it becomes.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/social-media-knows-as-much-about-the-holidays-as-santa-does/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Gnip Cagefight #2: Pumpkin Pie vs. Pecan Pie</title>
		<link>http://blog.gnip.com/gnip-cagefight-2-pumpkin-pie-vs-pecan-pie/</link>
		<comments>http://blog.gnip.com/gnip-cagefight-2-pumpkin-pie-vs-pecan-pie/#comments</comments>
		<pubDate>Tue, 22 Nov 2011 23:21:20 +0000</pubDate>
		<dc:creator>Scott Hendrickson, Data Science</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Just For Fun]]></category>
		<category><![CDATA[cagematch]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[newsgator]]></category>
		<category><![CDATA[pecan pie]]></category>
		<category><![CDATA[pie]]></category>
		<category><![CDATA[pumpkin pie]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[twitter data]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2460</guid>
		<description><![CDATA[Thanksgiving is a time for family gatherings, turkey with all the delicious fixings, football, and let’s not forget, pie! If your family is anything like mine, multiple pie flavors are required to satisfy the differing palates and strong opinions. So we wondered, which pies are people discussing for the holiday? What better way to celebrate [...]]]></description>
			<content:encoded><![CDATA[<p>Thanksgiving is a time for family gatherings, turkey with all the delicious fixings, football, and let’s not forget, pie! If your family is anything like mine, multiple pie flavors are required to satisfy the differing palates and strong opinions. So we wondered, which pies are people discussing for the holiday? What better way to celebrate and answer that question than with a Gnip Cagefight. </p>
<h2>Welcome to the Battle of the Pies!</h2>
<p>For those of you that have been in a pie eating contest or had a pie in the face, you know this one will be a fight all the way down to the very last crumb. In one  corner (well actually it is the Gnip Octagon so can you really have corners, oh well) we have The Traditionalist, pumpkin pie and in the opposite corner, The New Comer, pecan pie. Without further ado, Ladies and Gentleman, Let’s Get Ready to Rumble, wait wrong sport. Let’s Fight!  </p>
<h2>Six Social Media Sources, Two Words, One Winner . . . And the Winner Is . . .</h2>
<p></p>
<table align="center" style="border: 1px solid black;border-collapse:collapse">
<tr>
<th>&nbsp;Source&nbsp;</th>
<th>&nbsp;Pumpkin Pie&nbsp;</th>
<th>&nbsp;Pecan Pie&nbsp;</th>
<th>&nbsp;Winning Ratio&nbsp;<br />&nbsp;Pumpkin Pie to Pecan Pie&nbsp;</th>
</tr>
<tr>
<td style="border: 1px solid black">Twitter</td>
<td align="center" style="border: 1px solid black">X</td>
<td style="border: 1px solid black"></td>
<td align="center" style="border: 1px solid black">4:1</td>
</tr>
<tr>
<td style="border: 1px solid black">Facebook</td>
<td align="center" style="border: 1px solid black">X</td>
<td style="border: 1px solid black"></td>
<td align="center" style="border: 1px solid black">5:1<br />
</tr>
<tr>
<td style="border: 1px solid black">Google+</td>
<td align="center" style="border: 1px solid black">X</td>
<td style="border: 1px solid black"></td>
<td align="center" style="border: 1px solid black">6:1</td>
</tr>
<tr>
<td style="border: 1px solid black">Newsgator</td>
<td align="center" style="border: 1px solid black">X</td>
<td style="border: 1px solid black"></td>
<td align="center" style="border: 1px solid black">3:1</td>
</tr>
<tr>
<td style="border: 1px solid black">WordPress</td>
<td align="center" style="border: 1px solid black">X</td>
<td style="border: 1px solid black"></td>
<td align="center" style="border: 1px solid black">5:1</td>
</tr>
<tr>
<td style="border: 1px solid black">WordPress Comments&nbsp;</td>
<td align="center" style="border: 1px solid black">X</td>
<td style="border: 1px solid black"></td>
<td align="center" style="border: 1px solid black">2:1</td>
</tr>
<tr>
<td align="center" style="border: 1px solid black">Overall</td>
<td align="center" style="border: 1px solid black">+6  Winner!</td>
<td align="center" style="border: 1px solid black">+0  <img src='http://blog.gnip.com/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' /> </td>
<td style="border: 1px solid black"> </td>
</tr>
</table>
<p></p>
<p>We looked at one week’s worth of data across six of the top social media sources and determined that pumpkin pie “takes the cake” (so to speak) across every source. </p>
<p>In this case, it is interesting to point out that in sources like Twitter, Facebook, Google+ and WordPress we see higher winning ratios, while sources that tend to have higher latency such as Newsgator and WordPress Comments were a little more even.  Is this because, on further consideration, pecan pie sounds pretty good? Or is it that everyone will have to have two pies and, with pecan as the traditional second, it is highly discussed?</p>
<h2>Top Pie Recipes</h2>
<p>Even though pumpkin pie was our clear winner, we thought it would be fun to share a few of the most popular holiday pie recipes by social media source:</p>
<ol>
<li>Twitter &#8211; <a href="http://www.vcstar.com/news/2011/nov/12/cook-du-jour-shares-recipe-for-gluten-free-pie/">Cook du Jour Gluten-Free Pumpkin Pie</a> and <a href="http://www.youtube.com/watch?feature=player_embedded&amp;v=pvsPSDfyUwk#!">Pecan Pie Video Recipe from joyofcooking.com</a>
<li>Facebook  &#8211; <a href="http://realitytvmagazine.sheknows.com/2011/11/20/ben-starrs-pumpkin-bourbon-pecan-pie-recipe/">Ben Starr’s Pumpkin Bourbon Pecan Pie Recipe </a>
<li>Newsgator &#8211; <a href="http://www.blogher.com/snippets/pumpkin-pecan-roulade-orange-mascarpone-cream">BlogHer’s Pumpkin Pecan Roulade with Orange Mascarpone Cream Pie Recipe</a>
<li>WordPress and WordPress Comments &#8211; <a href="http://allrecipes.com/Recipe/chocolate-bourbon-pecan-pie/detail.aspx">Chocolate Bourbon Pecan Pie from allrecipes.com</a>
</ol>
<h2>Non-Traditional Thanksgiving Pies</h2>
<p>Another interesting fact that came out of this Cagefight was the counts of non-traditional Thanksgiving pies that were mentioned across the social media sources we surveyed. Though we rarely find these useful for communicating numerical values effectively, you can’t not have a pie chart in this post.</p>
<p><a href="http://blog.gnip.com/wp-content/uploads/2011/11/pie.png"><img align="center" src="http://blog.gnip.com/wp-content/uploads/2011/11/pie-1024x644.png" alt="" width="450" height="283" class="aligncenter size-large wp-image-2461" /></a></p>
<p>Happy Thanksgiving!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/gnip-cagefight-2-pumpkin-pie-vs-pecan-pie/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Launching Gnip MarketStream &amp; Partnership with StockTwits</title>
		<link>http://blog.gnip.com/launching-gnip-marketstream-partnership-with-stocktwits/</link>
		<comments>http://blog.gnip.com/launching-gnip-marketstream-partnership-with-stocktwits/#comments</comments>
		<pubDate>Thu, 17 Nov 2011 15:10:05 +0000</pubDate>
		<dc:creator>Seth McGuire, Director of Asset Management &#38; Financial Technology</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Partners]]></category>
		<category><![CDATA[Product]]></category>
		<category><![CDATA[asset management]]></category>
		<category><![CDATA[finance]]></category>
		<category><![CDATA[hedge funds]]></category>
		<category><![CDATA[launch]]></category>
		<category><![CDATA[marketstream]]></category>
		<category><![CDATA[partners]]></category>
		<category><![CDATA[stocktwits]]></category>
		<category><![CDATA[trading]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2446</guid>
		<description><![CDATA[While the market has been on its roller coaster ride across the past month, Gnip has kept its collective head down and stayed busy on behalf of our Investment Management clients (hedge funds, HFTs, asset managers, etc.). That hard work has paid off and we have two exciting announcements to make today. Launch of Gnip [...]]]></description>
			<content:encoded><![CDATA[<p>While the market has been on its roller coaster ride across the past month, Gnip has kept its collective head down and stayed busy on behalf of our Investment Management clients (hedge funds, HFTs, asset managers, etc.). That hard work has paid off and we have two exciting announcements to make today.</p>
<ul>
<li><strong>Launch of Gnip MarketStream</strong>: Our hedge fund clients have been quite vocal in their desire for a package incorporating the most relevant social media data streams into a single low-latency, high-volume solution. We’re proud to answer their needs with the launch of Gnip MarketStream, a realtime data solution that packages the incredibly rich and broad “voice of the market” Twitter stream with the uniquely deep and targeted “voice of the trader” StockTwits stream.</li>
</ul>
<ul>
<li><strong>Premium Partnership with StockTwits</strong>: An integral component of the Gnip MarketStream is StockTwits social media data. We’re thrilled to announce this partnership with StockTwits, the leading realtime financial platform for the investment community and creator of the $(TICKER) tag. The StockTwits stream is a curated, defined-demographic, realtime social data stream focused on investment decisions and analysis. Gnip now provides streaming access to the full StockTwits firehose of social data, and offers access to historical content as far back as 2009.</li>
</ul>
<p>While the use of social media data by the investment community has included use of this data in news analysis and equity research, the primary adoption of this data across the last six months has been as a trading indicator. By combining the strengths of both the Twitter stream and the StockTwits stream, Gnip MarketStream provides investment professionals unparalleled access to relevant social data at time when social media has become an increasingly vital channel for news and market sentiment.</p>
<p>For more information about Gnip MarketStream or StockTwits data, contact <a title="trading@gnip.com" href="trading@gnip.com">trading@gnip.com</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/launching-gnip-marketstream-partnership-with-stocktwits/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Boulder Chamber of Commerce: Why Gnip Joined</title>
		<link>http://blog.gnip.com/boulder-chamber-of-commerce/</link>
		<comments>http://blog.gnip.com/boulder-chamber-of-commerce/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 00:33:21 +0000</pubDate>
		<dc:creator>Jud Valeski, Co-Founder and CEO</dc:creator>
				<category><![CDATA[Corporate]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[boulder chamber]]></category>
		<category><![CDATA[boulder colorado]]></category>
		<category><![CDATA[gnip]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2429</guid>
		<description><![CDATA[It took awhile, but Gnip&#8217;s now a Boulder Chamber of Commerce (@boulderchamber) Member. We joined after a pattern of clear value to our particular industry became clear. In August of this year they hosted an event on that put us face-to-face with a the U.S. Department of Commerce Under Secretary for International Trade (Francisco Sánchez) [...]]]></description>
			<content:encoded><![CDATA[<p>It took awhile, but Gnip&#8217;s now a <a href="http://www.boulderchamber.com/">Boulder Chamber of Commerce</a> (<a href="http://twitter.com/#!/boulderchamber">@boulderchamber</a>) Member. We joined after a pattern of clear value to our particular industry became clear. In August of this year they hosted an event on that put us face-to-face with a the U.S. Department of Commerce Under Secretary for International Trade (<a href="http://trade.gov/press/bios/sanchez.asp">Francisco Sánchez</a>) and Colorado Congressman (<a href="http://polis.house.gov/">Jared Polis</a>) where we discussed software patent issues, as well as immigration visa challenges the U.S. tech industry faces. Tonight I&#8217;m attending an event with Congressman Polis and a local software Venture Capitalist (<a href="http://www.jasonmendelson.com/wp/">Jason Mendelson</a>) to talk about challenges surrounding the hiring of technical talent locally, and globally.</p>
<p>These are topics with significant political/legislative dynamics, and the Chamber has given us, a local software firm, access to relevant forums in which we can get our point of view on the table; thank you.</p>
<p>Whether or not the Chamber has been providing this kind of relevant access all along, I don&#8217;t know (my perception is otherwise). I do know that the impact they&#8217;re having on us as a local software business, as well as the channel they&#8217;re giving Gnip to get its perspective heard in the broader (National) forum, is significant. I&#8217;d encourage other Boulder software/technology firms to support their efforts, contribute in their events, and help them build an agenda that in the end, helps us be more effective software/technical businesses.</p>
<p>Join us, in joining the Chamber.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/boulder-chamber-of-commerce/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Simplicity Wins</title>
		<link>http://blog.gnip.com/simplicity-wins/</link>
		<comments>http://blog.gnip.com/simplicity-wins/#comments</comments>
		<pubDate>Thu, 10 Nov 2011 21:34:30 +0000</pubDate>
		<dc:creator>Chris Hogue, Development</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[data activities]]></category>
		<category><![CDATA[data delivery]]></category>
		<category><![CDATA[ec2]]></category>
		<category><![CDATA[elastic mapreduce]]></category>
		<category><![CDATA[emr]]></category>
		<category><![CDATA[gnip]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[jruby]]></category>
		<category><![CDATA[jvm]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[tweets]]></category>

		<guid isPermaLink="false">http://blog.gnip.com/?p=2424</guid>
		<description><![CDATA[It seems like every once in a while we all have to re-learn certain lessons. As part of our daily processing, Gnip stores many terabytes of data in millions of keys on Amazon’s S3. Various aspects of serving our customers require that we pour over those keys and the data behind them, regularly. As an [...]]]></description>
			<content:encoded><![CDATA[<p>It seems like every once in a while we all have to re-learn certain lessons.</p>
<p>As part of our daily processing, Gnip stores many terabytes of data in  millions of keys on Amazon’s S3. Various aspects of serving our customers require that we pour over those keys and the data behind them, regularly.</p>
<p>As an example, every 24 hours we construct usage reports that provide  visibility into how our customers are using our service. Are they consuming a lot or a little volume? Did their usage profile change? Are they not using us at all? So on and so on. We also have what we affectionately refer to as the “dude where’s my tweet” challenge; of the billion activities we deliver each day to our customers, inevitably someone says “hey, I didn’t receive Tweet ‘X’ what gives?” Answering that question requires that we store the ID of every Tweet a customer ever receives. Pouring over all this data every 24 hours is a challenge.</p>
<p>As we started on the project, it seemed like a good fit for Hadoop. It involves pulling in lots of small-ish files, doing some slicing, aggregate the results, and spitting them out the other end. Because we&#8217;re hosted in Amazon it was natural to use their Elastic MapReduce service (EMR).</p>
<p>Conceptually the code was straight forward and easy to understand. The logic fit the MapReduce programming model well. It requires a lot of text processing and sorts well into various stages and buckets. It was up and running quickly.</p>
<p>As the size of the input grew it started to have various problems, many of  which came down to configuration. Hadoop options, JVM options, open file limits, number and size of instances, number of reducers, etc. We went through various rounds of tweaking settings and throwing more machines in the cluster, and it would run well for a while longer.</p>
<p>But it still occasionally had problems. Plus there was that nagging feeling  that it just shouldn&#8217;t take this much processing power to do the work.  Operational costs started to pop up on the radar.</p>
<p>So we did a small test to check the feasibility of getting all the necessary files from S3 onto a single EC2 instance and processing it with standard old *nix tools. After promising results we decided to pull it out of EMR. It took several days to re-write, but we&#8217;ve now got a simple Ruby script using various *nix goodies like cut, sort, grep and their friends. The script is parallel-ized via JRuby threads at various points that make sense (downloading multiple files at once and processing the files independently once they&#8217;ve been bucketed).</p>
<p>In the end it runs in less time than it did on EMR, on a single modest  instance, is much simpler to debug and maintain, and costs far less money to run.</p>
<p>We landed in a somewhat counter-intuitive place. There’s great technology  available these days to process large amounts of data; we continue to use Hadoop for other projects. But as we start to bring them into our tool-set we have to be careful not to forget the power of straight forward, traditional tools.</p>
<p>Simplicity wins.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.gnip.com/simplicity-wins/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

