Posts Tagged ‘Publishers’

Posted in APIs,Partners,Publishers by Eric No Comments

We always get some interesting requests for doing additional processing on data sources.   Some of these are addressed using Gnip filters, but others do not really fit the filter model.   In able to support richer or more complex data processsing we have built some additional features into the Gnip platform.   The first new publisher using some of these new features is “Digg-2000“.

Digg-2000 Publisher — What is it?

Lots of people submit stories to Digg and lots of other people Digg the stories which allows more popular information to rise to the top in being discoverable.    Several Gnip customers asked if we could make it possible for them to only receive stories that had a specific number of Diggs.   We asked Digg about the idea and they said it sounded great since they have a Twitter account that provides a similar type of feature, so the Digg-2000 Gnip publisher was born.

Digg-2000 Publisher — How it works?

On the Gnip platform we have set up a publisher that is listening to activities on Digg.

  • When new stories are submitted to Digg we pick those new activites up along with the Digg count on the story and they are posted to the standard Gnip Digg publisher.
  • With every new Digg on a story we increment our tracking of the story and when we hit 2000 diggs we re-post the original story to the Digg-2000 publisher.
  • The configuration of the Digg-2000 publisher allows for us to turn two different dials.
    • The default configuration only will re-post stories that were first posted on Digg in the last two days.   This means we are looking for current active stories and not stories that were posted months ago and through a slow and steady interest finally hit 2000 Diggs.
    • The default configuration re-posts at 2000 Diggs.   This can be set to any number of Diggs — 100, 1000, 2000, etc.

Posted in APIs,Industry,Partners by Eric No Comments

Welcome PostRank!

Today Gnip and PostRank announced a new partnership (blog) (press release)that allows companies using the Gnip platform to access the nearly 3 million news articles and stories indexed by PostRank from a million discrete sources every day.  In addition PostRank collects the real-time social interactions with content across dozens of social networks and applications.

Gnip will be providing PostRank as a premium service that requires a subscription.  All of the value added features of the Gnip platform including normalization, rule-based filtering, and push delivery of content are supported with the new Gnip PostRank Data Publisher.   For pricing information contact info@gnip.com or shane@gnip.com

Posted in APIs,Customers,Publishers,Strategy,solutions by Eric No Comments

Obviously we have some understanding on the concepts of pushing and polling of data from service endpoints since we basically founded a company on the premise that the world needed a middleware push data service.    Over the last year we have had a lot of success with the push model, but we also learned that for many reasons we also need to work with services via a polling approach.   For this reason our latest v2.1 includes the Gnip Service Polling feature so that we can work with any service using push, poll or a mixed approach.

Now, the really great thing for users of the Gnip platform is that how Gnip collects data is mostly abstracted away.   Every end user developer or company has the option to tell Gnip where to push data that you have set up filters or have a subscription.   We also realize not everyone has an IT setup to handle push so we have always provided the option for HTTP GET support that lets people grab data from a Gnip generated URL for your filters.

One place where the way Gnip collects data can make a difference, at this time, for our users is the expected latency of data.  Latency here refers to the time between the activity happening (i.e. Bob posted a photo, Susie made a comment, etc) and the time it hits the Gnip platform to be delivered to our awaiting users.     Here are some basic expectation setting thoughts.

PUSH services: When we have push services the latency experience is usually under 60 seconds, but we know that this is not always the case sense sometimes the services can back-up during heavy usage and latency can spike to minutes or even hours.   Still, when the services that push to us are running normal it is reasonable to expect 60 second latency or better and this is consistent for both the Community and Standard Edition of the Gnip platform.

POLLED services:   When Gnip is using our polling service to collect data the latency can vary from service to service based on a few factors

a) How often we hit an endpoint (say 5 times per second)

b) How many rules we have to schedule for execution against the endpoint (say over 70 million on YouTube)

c) How often we execute a specific rule (i.e. every 10 minutes).     Right now with the Community edition of the Gnip platform we are setting rule execution by default at 10 minute intervals and people need to have this in mind with their expectation for data flow from any given publisher.

Expectations for POLLING in the Community Edition: So I am sure some people who just read the above stopped and said “Why 10 minutes?”  Well we chose to focus on “breadth of data ” as the initial use case for polling.   Also, the 10 minute interval is for the Community edition (aka: the free version).   We have the complete ability to turn the dial and use the smarts built into the polling service feature we can execute the right rules faster (i.e. every 60 seconds or faster for popular terms and every 10, 20, etc minutes or more for less popular ones).    The key issue here is that for very prolific posting people or very common keyword rules (i.e. “obama”, “http”, “google”) there can be more posts that exist in the 10 minute default time-frame then we can collect in a single poll from the service endpoint.

For now the default expectation for our Community edition platform users should be a 10 minute execution interval for all rules when using any data publisher that is polled, which is consistent with the experience during our v2.1 Beta.    If your project or company needs something a bit more snappy with the data publishers that are polled then contact us at info@gnip.com or contact me directly at shane@gnip.com as these use cases require the Standard Edition of the Gnip platform.

Current pushed services on the platform include:  WordPress, Identi.ca, Intense Debate, Twitter, Seesmic,  Digg, and Delicious

Current polled services on the platform include:   Clipmarks, Dailymotion, deviantART, diigo, Flickr, Flixster, Fotolog, Friendfeed, Gamespot, Hulu, iLike, Multiply, Photobucket, Plurk, reddit, SlideShare, Smugmug, StumbleUpon, Tumblr, Vimeo, Webshots, Xanga, and YouTube

Posted in APIs,Partners,Publishers by Eric No Comments

We are pleased to be announce an agreement with Automattic, Inc. that allows us to add WordPress.com as our newest data publisher in the standard edition of the Gnip platform.

Gnip now provides access to the WordPress XMPP firehose for posts and comments.   The WordPress.com firehose is designed for companies who would like to ingest a real-time stream of new WordPress.com posts and comments the second they get published and access is via subscription only.   For more information contact Gnip at info@gnip.com

Posted in APIs,Publishers,Release by Eric No Comments

After running what we believe has been a very complete beta program for the last three months we are ready to officially launch our 2.1 version next week at the end of the day Tuesday, May 12th.

What will happen on May 12th

  1. v2.1 of the Gnip platform available at http://api.gnip.com will become the officially supported version.   Existing customers of the standard version of our product are all being contacted directly via email.   Community version users are being notified by our official newsletter, this blog post and our standard practice of posting to our Twitter account @gnipsupport.
  2. Version 2.0 will be deprecated and continue to be available for 30 days.  Existing users of the http://prod.gnipcentral.com version of the service are encouraged to move to the new version as soon as possible.   The point of the 3 month beta program was to provide time to upgrade to the new Gnip v2.1 data schema.  Read up on the new version at http://www.gnip.com/docs

Posted in APIs,Publishers,Release by Eric No Comments

We just finished making a major upgrade to the beta api.gnip.com environment.  First, thank you to everyone for their patience during our middle of the day upgrade.  We normally schedule upgrades off hours or do a rolling upgrade, but tonight the entire team of ten is going to the Star Trek premiere.   Anyway, we made the “management” decision to do the upgrade earlier in the day so we did not run into our company/family event tonight.

What’s new?   Up until now the data publishers in api.gnip.com have been doing lazy scheduling, which means they would pull data but it was actually easy with a small number of rules to miss data or not have a filter get a hit in our scheduling.  Yeah, that is a beta thing as the primary reason for having the beta out for so long was giving users a chance to get all their existing integrations moved to the new schema.   With Beta 3 we have made some major enhancements system that from what we see in our tests greatly improve the amount of data flow across all our data publishers using the new polling services.

From a timeline standpoint we want to let the new features soak a bit and then we will lock down a date to take the system to production as early as the end of May.     In the next few days we are running diagnostics and scaling out the system by putting it through the paces, so feel free to do the same or just work with our notification streams on any given publisher.

Live long and prosper.  (sorry, just could not resist the Star Trek line)

Posted in APIs,Publishers,Release by Eric No Comments

We continue to push out new publishers to the beta http://api.gnip.com environment as we work to finish up the release and get the final touches on lots of new features.

The new publishers this week include the following:

  • FriendFeed-search:  Supports the KEYWORD rule-type and works with the standard FriendFeed Search interface for tracking conversations
  • Hulu: Supports the ACTOR rule-type and works with the standard Hulu interface for tracking conversations
  • Hulu-search: Supports the KEYWORD rule-type and works with the standard Hulu Search interface
  • YouTube: Supports the ACTOR and TAG rule-types and works witih the standard YouTube interface and tracks “uploads”
  • YouTube-search: Supports the KEYWORD rule type and works witih the standard YouTube-search interface

Ok, now go grab some data from these or any of our other now 20+ data publishers in the system.   Or read up on the new features in http://www.gnip.com/docs

continue reading…

Posted in APIs,Customers,Publishers,Release by Eric No Comments

Next time  you login to the Gnip Developer Site at http://api.gnip.com you will notice that we made some tweaks to the layout and design.     The prior site was first pulled together right before the v1.0 went live last summer and it had stayed the same up until this update.  (yeah, we should have updated it sooner….)

In the updated design we added more explanations of where to start for newer users, where to go for information and cleanup up the interface.   We are planning a more complete update in the future and at that time will add more functionality so if there are wish lists out there please do let us know.

gnip_developer_site_new_mar_09

Posted in APIs,Publishers by Eric No Comments

With our schema now finalized and in beta at http://demo.gnip.com and the crowd-sourcing application launched to help us prioritize our publisher integration schedule the team is now heads down building out more publishers on the Gnip platform.

Today we put nine ten new publishers into demo.gnip.com.   All of these are using the updated schema and provide support for notifications and activities with full-data.  Have fun integrating some data!

  1. Delicious
  2. Fotolog
  3. Plurk
  4. Reddit
  5. Slideshare (added after original blog post)
  6. Stumbleupon
  7. Tumblr
  8. Twitter-search
  9. Vimeo
  10. Webshots

Posted in APIs,Publishers by Eric No Comments

Now that we have the beta of the new Gnip schema up in the new demo system we are ready to roll out something we hope everyone in our partner and user community will be excited to participate.

Today we have turned on a new web application that we are hosting at UserVoice, who specializes in hosting customer feedback forums.   The new Gnip Forum provides anyone and everyone the chance to tell us the social media and business services that we should integrate to and the priority of integration.

You heard us, we want you to tell us what to do!   Just go to http://gnip.uservoice.com.  What you will find is a list that as of right now includes 352 different services in priority order based on votes for those services.  Just create an account and decide how to allocate your votes. Also, if there is a service we did not include feel free to add the service and tell us the URL and it will be added automatically.

The list also includes status labels that will allow people to track our progress.    There are a few new services in progress right now and after we complete the current beta on the new schema we expect to be adding publishers at a good clip.