文章目录
- 1 2016-10-01
- 2 2016-10-02
- 3 2016-10-03
- 4 2016-10-04
- 5 2016-10-05
- 6 2016-10-06
- 7 2016-10-07
- 8 2016-10-08
- 9 2016-10-09
- 10 2016-10-10
- 11 2016-10-11
- 12 2016-10-12
- 13 2016-10-13
- 14 2016-10-14
- 15 2016-10-15
- 16 2016-10-16
- 17 2016-10-17
- 18 2016-10-18
- 19 2016-10-19
- 20 2016-10-20
- 21 2016-10-21
- 22 2016-10-22
- 23 2016-10-23
- 24 2016-10-24
- 25 2016-10-25
- 26 2016-10-26
- 27 2016-10-27
- 28 2016-10-28
- 29 2016-10-29
- 30 2016-10-30
- 31 2016-10-31
流处理系统月刊是一份专门收集关于Spark、Flink、Kafka、Apex等流处理系统的技术干货月刊,完全免费,每天更新,欢迎关注。下面资源如无法正常访问,请使用《最新可访问Google的Hosts文件》或《Tunnello:免费的浏览器翻墙插件》进行科学上网。
2016-10-01
- [Slides] Real Time Aggregation with Kafka ,Spark Streaming and ElasticSearch , scalable beyond Million RPS. Dibyendu B, InstartLogic http://events.linuxfoundation.org/sites/events/files/slides/Real%20Time%20Aggregation%20with%20Kafka%20,Spark%20Streaming%20and%20ElasticSearch%20,%20scalable%20beyond%20Million%20RPS.pdf
- [Blog] Converting csv to Parquet using Spark Dataframes http://www.dataarchitect.cloud/converting-csv-to-parquet-using-spark-dataframes
- [Talk] Indexing Time Series Streams with Samza and Druid — Gian Merlino (MetaMarkets.com) https://www.youtube.com/watch?v=bg2uPZO6Fic
- [Conference Talk] Adaptive Data Cleansing with StreamSets and Cassandra Slides: http://www.slideshare.net/DataStax/adaptive-data-cleansing-with-streamsets-and-cassandra-pat-patterson-streamsets-c-summit-2016
2016-10-02
- [Blog] Chat With Akka HTTP Websockets http://markatta.com/codemonkey/blog/2016/10/02/chat-with-akka-http-websockets/
2016-10-03
- [Blog] Understanding Kafka Consumer Groups and Consumer Lag https://www.opsclarity.com/understanding-kafka-consumer-lag/
- [Webinar] Building Real-Time Data Analytics Applications on AWS - September 2016 Webinar Series Video: https://www.youtube.com/watch?v=pHZxA3eFr-E
- [Blog] Amazon's Big Data Blog: Month in Review - September 2016 http://blogs.aws.amazon.com/bigdata/post/Tx2HXSTTCXOT52D/Month-in-Review-September-2016
- [Article] Streaming Data: Big Data at High Velocity https://www.upwork.com/hiring/data/streaming-data-high-velocity/
- [Blog] Gather insights from user events – part 1 https://amsterdam.luminis.eu/2016/10/03/gather-insights-from-user-events-part-1/
- [Blog] Understanding Kafka Consumer Groups and Consumer Lag https://www.opsclarity.com/understanding-kafka-consumer-lag/
2016-10-04
- [Blog] Beginner’s Guide to Apache Flink – 12 Key Terms, Explained http://www.kdnuggets.com/2016/10/beginners-guide-apache-flink-explained.html
- [Conference Talk] Building Reactive Distributed Systems For Streaming Big Data, Analytics & Machine Learning, Helena Edelson. http://www.slideshare.net/helenaedelson/building-reactive-distributed-systems-for-streaming-big-data-analytics-machine-learning
- [Conference Talk] Riding the Jet Streams, Viktor Gamov https://reactivesummit2016.sched.org/event/7emv/riding-the-jet-streams
- [Conference Talk] Reactive Kafka with Akka Streams, Krzysztof Ciesielski https://reactivesummit2016.sched.org/event/7emi/reactive-kafka-with-akka-streams Slides: http://www.slideshare.net/slideshow/embed_code/key/Ikj7NTgl8pTmP7
- [Conference Talk] Reactive Stream processing at Netflix https://reactivesummit2016.sched.org/event/7emf/reactive-stream-processing-at-netflix Slides: http://www.slideshare.net/nickmahilani/reactive-stream-processing-with-mantis-66747882
- [Conference Talk] A brief history of time with Apache Flink. Flink Forward 2016. http://www.slideshare.net/FlinkForward/thomas-lamiraultmohamed-amine-abdessemed-a-brief-history-of-time-with-apache-flink
2016-10-05
- [Meetup] Apache Flink's Graph processing API, Gelly, Apache Flink London Meetup http://www.meetup.com/Apache-Flink-London-Meetup/events/234410049/
- [Meetup] Robust Stream Processing with Apache Flink, SF Big Analytics http://www.meetup.com/SF-Big-Analytics/events/234209595/
- [Conference Talk] Back-Pressure in Action: Handling High-Burst Workloads with Akka Streams & Kafka @PayPal https://reactivesummit2016.sched.org/event/7emk/back-pressure-in-action-handling-high-burst-workloads-with-akka-streams-kafka-paypal
- [Conference Talk] Distributed stream processing with Apache Kafka, Jay Kreps https://reactivesummit2016.sched.org/event/8B6K/distributed-stream-processing-with-apache-kafka
- [Conference Talk] Reactive Integrations with Akka Streams that Just Work! https://reactivesummit2016.sched.org/event/7emZ/reactive-integrations-with-akka-streams-that-just-work
- [Conference Talk] Implementing an akka-streams materializer for big data https://reactivesummit2016.sched.org/event/7jeY/implementing-an-akka-streams-materializer-for-big-data
- [Presentation] Moving Beyond Lambda Architectures with Apache Kudu Slides: http://www.slideshare.net/cloudera/moving-beyond-lambda-architectures-with-apache-kudu
- [Blog] Stream Processing with Kafka Streams https://www.hugopicado.com/2016/10/05/stream-processing-with-kafka-streams.html
- [Blog] How IoT and AI are driving analytics innovation http://www.iothub.com.au/news/how-iot-and-ai-is-driving-analytics-innovation-438781
- [Presentation] Apache Kafka - Spark Streaming With Brew (DevOps) https://www.youtube.com/watch?v=XoAmxOzN0l0
- [Blog] How to persist Kafka streams as JSON in No-SQL storage http://www.bigendiandata.com/2016-10-05-MaprDB-demo-with-Drill/
- [Article] Kafka Connect ReThink http://docs.datamountaineer.com/en/latest/rethink-source.html
2016-10-06
- [Meetup] Flink.TW x GCPUG.TW Co-Meetup: Apache Beam, Google Dataflow, and lots more! http://www.meetup.com/flink-tw/events/234344702/
- [Meetup] Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club, Los Angeles Bog Data Users Grouphttp://www.meetup.com/Los-Angeles-Big-Data-Users-Group/events/233972195/
- [Meetup] Tom Kealy - Teaching computers to play games / Julien Nioche - StormCrawler. Bristech meetup, Bristol, UK. http://www.meetup.com/bristech/events/232642380/
- [Blog] Understanding Kafka Consumer Groups and Consumer Lag: Part I https://dzone.com/articles/understanding-kafka-consumer-groups-and-consumer-l
- [Slides] Functional Comparison and Performance Evaluation of Streaming Frameworks http://www.slideshare.net/HuafengWang/functional-comparison-and-performance-evaluation-of-streaming-frameworks
- [Webinar] Introduction to Hortonworks Dataflow http://hortonworks.com/webinar/introduction-hortonworks-dataflow/
- [Slides] Gearpump Materializer: enabling distributed akka streams http://www.slideshare.net/kamkasravi/gearpump-akka-streams
- [Meetup] A Deep Dive into Structured Streaming Slides: http://www.slideshare.net/databricks/a-deep-dive-into-structured-streaming-apache-spark-meetup-at-bloomberg-2016
- [Blog] Understanding Kafka Consumer Groups and Consumer Lag: Part I https://dzone.com/articles/understanding-kafka-consumer-groups-and-consumer-l
- [News] Heron Version 0.14.4 released https://github.com/twitter/heron/releases/tag/0.14.4
2016-10-07
- [Blog] Introducing the real-time data store: Kinesis Streams https://cloudonaut.io/introducing-the-real-time-data-store-kinesis-streams/
- [Meetup] SF Big Analytics 20161005: Robust Stream Processing with Apache Flink https://www.youtube.com/watch?v=qLvej_vHVFU
- [Blog] Understanding Kafka Consumer Groups and Consumer Lag: Part IIhttps://dzone.com/articles/understanding-kafka-consumer-groups-and-consumer-l-1
- [Podcast] Kafka Streams with Jay Kreps http://softwareengineeringdaily.com/2016/10/07/kafka-streams-with-jay-kreps/
- [Blog] Scale-out Security Event Processing for detection and analysis. https://medium.com/@henrikjohansen/scale-out-security-event-processing-for-detection-and-analysis-c756e0d188a4
- [Podcast] Kafka Streams with Jay Kreps http://softwareengineeringdaily.com/2016/10/07/kafka-streams-with-jay-kreps/
- [Blog] Kafka Streams - Scaling up or down http://aseigneurin.github.io/2016/10/07/kafka-streams-scaling-up-or-down.html
- [Conference Talk] Applications in the Emerging World of Stream Processing, Neha Narkhede. GOTO 2016 Chicago Conference. Video: https://www.youtube.com/watch?v=WuBQBTET8Qg
- [Conference Talk] Asynchronous micro-services and the unified log, Crunch Conference in Budapest http://www.slideshare.net/alexanderdean/asynchronous-microservices-and-the-unified-log
2016-10-08
- [Blog] Reactive Summit 2016 Conference: Reactive Microservices and Staging Data Pipelines, Srini Penchikala https://www.infoq.com/news/2016/10/reactive-summit-day1
2016-10-09
- [Blog] Streaming with Apache Spark 2.0 https://dzone.com/articles/streaming-with-apache-spark-20
- [Conference Talk] Stream processing made easy with riko. PyCon South Africa https://www.youtube.com/watch?v=oyM3CxtCjUQ
- [Slides] Twitter's Real Time Stack - Processing Billions of Events Using Distributed Log and Heron http://www.slideshare.net/KarthikRamasamy3/twitters-real-time-stack-processing-billions-of-events-using-distributed-log-and-heron
- [Presentation] Deep Dive Into Apache Kafka, Jun Rao https://vimeo.com/185844593/77f7d239a3
2016-10-10
- [News] Bigstep Launches the First Real-Time Container Service http://www.businesswire.com/news/home/20161010005047/en/Bigstep-Launches-Real-Time-Container-Service
- [Slides] SC4 Workshop 2: Luigi Selmi : Big Data Europe Transport Pilot http://www.slideshare.net/BigData_Europe/sc4-workshop-2-luigi-selmi-big-data-europe-transport-pilot
- [Conference Talk] Micro-architectural Characterization of Apache Spark on Batch and Stream Processing Workloads. IEEE Conference on Big Data and Cloud Computing, 2016, Atlanta, USA http://www.slideshare.net/ahsanjaved_nust/microarchitectural-characterization-of-apache-spark-on-batch-and-stream-processing-workloads
- [Podcast] Reactive Streams and Microservices With Akka https://dzone.com/articles/reactive-streams-and-microservices-with-akka
- [Conference Talk] Streaming Analytics, Ashish Gupta, LinkedIn Corporation. KDD Conference 2016 Part 1 https://www.youtube.com/watch?v=L-IS5NaUVv4 Part 2 https://www.youtube.com/watch?v=c68yqYnpy1A
- [Presentation] Staging Reactive data pipelines using Kafka as the backbone http://www.datascienceassn.org/sites/default/files/Staging%20Reactive%20Data%20Pipelines%20Using%20Kafka%20as%20Backbone.pdf
2016-10-11
- [Blog] A Few of Our Favorite Insights from Flink Forward 2016 http://data-artisans.com/flink-forward-favorites-2016/
- [Blog] Connecting VoltDB to Amazon Kinesis streams https://www.voltdb.com/blog/connecting-voltdb-to-amazon-kinesis-streams
- [Blog] Transparent End-to-End security for Apache Kafka – Part 1, https://blog.codecentric.de/en/2016/10/transparent-end-end-security-apache-kafka-part-1/
- [Presentation] Training Kafka and ZooKeeper- Monitoring and Operability http://www.slideshare.net/NicolasMotte/training-kafka-and-zookeeper-monitoring-and-operability
- [Blog] Log Compaction | Highlights in the Apache Kafka and Stream Processing Community | October 2016 http://www.confluent.io/blog/log-compaction-highlights-in-the-apache-kafka-and-stream-processing-community-october-2016/
- [Conference Talk] Simplifying Logs, Events and Streams: Kafka + Rails by Terence Lee. EuRuKo 2016. https://www.youtube.com/watch?v=yl3JmF3n2bQ
- [Documentation] Cloudera Distribution of Apache Kafka http://www.cloudera.com/documentation/kafka/latest/PDF/cloudera-kafka.pdf
2016-10-12
- [Meetup] Gaining back pressure with Akka HTTP and Akka Streams + Spark 2.0 and EMR 5.0. Scala DC - MD - NOVA Meetup http://www.meetup.com/dc-scala/events/234626993/
- [Webinar] Make your Big Data ecosystem work better for you http://hortonworks.com/webinar/make-big-data-ecosystem-work-better/
- [Meetup] Streaming Technologies presented by Women in Big Data Forum https://www.meetup.com/Apache-Apex-Bay-Area/events/234670313/
- [Blog] Getting Started With Apache Flink and Kafka http://tgrall.github.io/blog/2016/10/12/getting-started-with-apache-flink-and-kafka/
- [News] Apache Flink 1.1.3 Released https://flink.apache.org/news/2016/10/12/release-1.1.3.html
- [Conference talk] Distributed stream processing with Apache Kafka, Jay Kreps Video; https://www.youtube.com/watch?v=rXzuZb3fHHM Slides: http://www.slideshare.net/Reactivesummit/distributed-stream-processing-with-apache-kafka
- [Blog] Streaming data from Oracle using Oracle GoldenGate and Kafka Connect, Robin Moffatt http://www.confluent.io/blog/streaming-data-oracle-using-oracle-goldengate-kafka-connect/
- [Blog] StormCrawler: An Open Source SDK for Building Web Crawlers with ApacheStorm https://www.linux.com/news/stormcrawler-open-source-sdk-building-web-crawlers-apachestorm
- [News] Announcing Data Collector ver 2.1.0.0 https://streamsets.com/blog/announcing-data-collector-ver-2-1-0-0/
- [Blog] Azure Stream Analytics query testing now available in the new portal https://azure.microsoft.com/en-us/blog/stream-analytics-querytesting/
2016-10-13
- [Meetup] Distributed stream processing with Apache Kafka, Jay Kreps. Apache London Meetup http://www.meetup.com/Apache-Kafka-London/events/234143182/
- [PPT] Extending the Yahoo streaming benchmark to Apache Apex http://www.slideshare.net/DataTorrent/extending-the-yahoo-streaming-benchmark-to-apache-apex-67147894
- [Webinar] Getting to the goal of Big Data faster: Enterprise readiness for Data in motion http://hortonworks.com/webinar/getting-goal-big-data-faster-enterprise-readiness-data-motion/
- [Conference Talk] Robust Stream Processing with Apache Flink. Jamie Grier. Reactive Summit 2016 Video: https://www.youtube.com/watch?v=kyLtjuz5A0c
- [Conference Talk] Stream Processing with Apache Flink in Zalando's World of Microservice. Reactive Summit 2016 Video: https://www.youtube.com/watch?v=54_MdfHSoEY
- [Blog] Getting Started with Apache Flink and MapR Streams https://www.mapr.com/blog/getting-started-apache-flink-and-mapr-streams
- [Blog] How Kafka’s Storage Internals Work https://medium.com/the-hoard/how-kafkas-storage-internals-work-3a29b02e026
- [Conference Talk] Back-Pressure in Action: Handling High-Burst Workloads with Akka Streams & Kafka @PayPal http://www.slideshare.net/reactivesummit/backpressure-in-action-handling-highburst-workloads-with-akka-streams-kafka-paypal
- [Conference Talk] Embracing Streams…Everywhere. Reactive Summit 2016 https://www.youtube.com/watch?v=5FE6xnH5Lak
- [Conference Talk] Staging Reactive Data Pipelines Using Kafka. Reactive Summit 2016 Video: https://www.youtube.com/watch?v=lMlspFnfHM8
- [Conference Talk] Reactive Integrations with Akka Streams that Just Work https://www.youtube.com/watch?v=r3YSpH3GMlk
- [Conference Talk] Reactive Stream processing at Netflix https://www.youtube.com/watch?v=5Gk0VKuG8yY
- [Meetup] New Features in Samza 0.10.0. Navina Ramesh, LinkedIn. Video: https://www.youtube.com/watch?v=t6Zb2_ldbdE
- [Blog] Analyzing Kafka data streams with Spark https://objectpartners.com/2016/10/13/analyzing-kafka-data-streams-with-spark/
2016-10-14
- [Blog] Small scale stream processing kata. Part 1: thread pools https://www.javacodegeeks.com/2016/10/small-scale-stream-processing-kata-part-1-thread-pools.html
- [Blog] Small scale stream processing kata. Part 2: RxJava 1.x/2.x https://www.javacodegeeks.com/2016/10/small-scale-stream-processing-kata-part-2-rxjava-1-x2-x.html
- [Blog] Turning Back Time: Reprocessing Data Streams with Savepoints in Apache Flink http://data-artisans.com/turning-back-time-savepoints/
- [Podcast] Kafka Event Sourcing with Neha Narkhede http://softwareengineeringdaily.com/2016/10/14/kafka-event-sourcing-with-neha-narkhede/
- [Presentation] GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop) Slides: http://www.slideshare.net/DataTorrent/ge-iot-predix-time-series-data-ingestion-service-using-apache-apex-hadoop-67188658
- [Blog] The Do's & the Don'ts, the Ins & the Outs: Using Kafka for Data Streaming http://blog.syncsort.com/2016/10/big-data/dos-donts-ins-outs-using-kafka-data-streaming/
- [Article] Reducing the MTTD and MTTR of LinkedIn’s Private Cloud https://engineering.linkedin.com/blog/2016/10/reducing-the-mttd-and-mttr-of-linkedins-private-cloud
- [Blog] Console App 2: The Kafka Enricher https://36chambers.wordpress.com/2016/10/14/console-app-2-the-enricher/
2016-10-15
- [Blog] Apache Kafka for Item Setup, Anil Kumar https://medium.com/walmartlabs/apache-kafka-for-item-setup-3fe8f4ba5967
- [Blog] Stateful Streaming in Spark and Kafka Streams https://www.madewithtea.com/stateful-streaming-in-spark-and-kafka-streams.html
2016-10-16
- [Article] Jay Kreps on Distributed Stream Processing with Apache Kafka and Kafka Streams https://www.infoq.com/news/2016/10/reactive-summit-2016-day2
2016-10-17
- [Blog] The tale of two messaging platforms: Apache Kafka and Amazon Kinesis https://medium.com/aws-activate-startup-blog/the-tale-of-two-messaging-platforms-apache-kafka-and-amazon-kinesis-654963bdbf35
- [Conference Talk] Pulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid, Tony Ng, Strata + Hadoop World 2016 http://www.slideshare.net/tcng3716/pulsar-realtime-analytics-at-scale-with-kafka-kylin-and-druid
- [Blog] Storm vs. Heron, Part 1: Reusing a Storm topology for Heron https://otter-in-a-suit.com/blog/?p=50
- [Blog] Stateful Streaming in Spark and Kafka Streams https://www.madewithtea.com/stateful-streaming-in-spark-and-kafka-streams.html
- [Meetup] A Deep Dive into Spark Streaming by Tathagata Das. https://www.youtube.com/watch?v=vJkfJ2Bqyig
- [Blog] Streaming Messages from Kafka into Redshift in near Real-Time https://engineeringblog.yelp.com/2016/10/redshift-connector.html
- [Presentation] RDF Stream Processing Tutorial: RSP implementations http://www.slideshare.net/jpcik/rdf-stream-processing-tutorial-rsp-implementations
2016-10-18
- [Meetup] Apache Beam on Apache Flink, Chicago Apache Flink Meetup http://www.meetup.com/Chicago-Apache-Flink-Meetup-CHAF/events/234533511/
- [Webinar] A paradigm Shift to Business as Usual: Real-Time Dataflows in Record Time. http://hortonworks.com/webinar/paradigm-shift-business-usual-real-time-dataflows-record-time/
- [Blog] Blink: How Alibaba Uses Apache Flink® http://data-artisans.com/blink-flink-alibaba-search/
- [Blog] To stream or not to stream http://blogs.sas.com/content/datamanagement/2016/10/18/to-stream-or-not-to-stream/
- [Blog] Complex event processing (CEP) with Apache Storm and Apache Ignite https://www.javacodegeeks.com/2016/10/complex-event-processing-cep-apache-storm-apache-ignite.html
- [News] Debezium 0.3.3 Released, Randall Hauch http://debezium.io/blog/2016/10/18/Debezium-0-3-3-Released/
2016-10-19
- [Whiteboard Walkthrough] Anomaly Detection in Telecommunications Using Complex Streaming Data https://www.mapr.com/blog/anomaly-detection-telecommunications-using-complex-streaming-data-whiteboard-walkthrough Video:https://www.youtube.com/watch?v=PM6BrL1Amyg
- [Conference Talk] New Stream Processing Platform with Apache Flink, DEVDAY2016 Video recording; https://www.youtube.com/watch?v=1hFP2Q-HL3Y Slides: http://www.slideshare.net/linecorp/b-6-new-stream-processing-platformwith-apache-flink
- [Blog] Building a Tracking Pixel in 3 Steps (featuring AWS Kinesis Firehose!) http://engineering.curalate.com/2016/10/19/building-a-tracking-pixel.html
- [Blog] Stream Processing and Lambda Architecture Challenges, Alexandre Rodrigueshttps://www.infoq.com/news/2016/10/stream-processing-lambda
- [Blog] Real World Use Cases of Real-Time Dataflows in Record Time http://hortonworks.com/blog/real-world-use-cases-real-time-dataflows-record-time/
- [Interview] How Apache Kafka is powering a real-time data revolution https://opensource.com/business/16/10/all-things-open-interview-neha-narkhede
- [Blog] Creating a Custom Processor for StreamSets Data Collector https://streamsets.com/blog/creating-custom-processor-streamsets-data-collector/
- [Presentation] Kafka @Scale in Cloud https://www.youtube.com/watch?v=e1VEEtAvQ9E
- [Conference Talk] Distributed stream processing with Apache Kafka http://www.slideshare.net/Reactivesummit/distributed-stream-processing-with-apache-kafka
- [Blog] Netflix Chaos Monkey Upgraded http://techblog.netflix.com/2016/10/netflix-chaos-monkey-upgraded.html
- [Meetup] Reactive integrations with Akka Streams, Scala User Group Stockholm http://www.slideshare.net/johanandren/scala-usergroup-stockholm-reactive-integrations-with-akka-streams
- [Webinar] Harnessing Data-in-Motion with Hortonworks DataFlow A Paradigm Shift to Business as Usual: Real World Use Cases of Real-Time DataFlows in Record Time by Anna Yong Product & Haimo Liu http://www.slideshare.net/hortonworks/hortonworks-data-in-motion-series-part-4
- [Article] Kafka Event Stream Modeling https://devcenter.heroku.com/articles/kafka-event-stream-modeling
- [Blog] Getting started with Druid and MapR Streams https://www.mapr.com/blog/getting-started-druid-and-mapr-streams
2016-10-20
- [Meetup] Spark Streaming and Scaling, Spark London http://www.meetup.com/Spark-London/events/234761939/
- [Video] A demo of Apache Flink with Docker on the BDE platform. Big Data Europe. https://www.youtube.com/watch?v=1zHIhFDDdCg
- [Blog] Announcing Apache Kafka 0.10.1.0 http://www.confluent.io/blog/announcing-apache-kafka-0-10-1-0/
- [Webinar] Getting started with Apache Flink using the BDE platform https://www.youtube.com/watch?v=JyUT7tipJy0
- [Blog] Beam Capability Matrix http://beam.incubator.apache.org/learn/runners/capability-matrix/
- [Blog] Testing Unbounded Pipelines in Apache Beam http://beam.incubator.apache.org/blog/2016/10/20/test-stream.html
- [Blog] StreamSets Data Collector – New Package Manager in Action https://guidoschmutz.wordpress.com/2016/10/20/streamsets-data-collector-new-package-manager-in-action/
- [Webinar] Monitoring Kafka and Understanding Consumer Lag, Allen Wang, Neflix https://www.youtube.com/watch?v=nyQF-JjXuAU
- [News] Apache NiFi 0.7.1 https://cwiki.apache.org/confluence/display/NIFI/Release+Notes#ReleaseNotes-Version0.7.1
- [Meetup] Design Patterns for working with Fast Data, Portland Java Users Group http://www.slideshare.net/MapRTechnologies/design-patterns-for-working-with-fast-data
- [Presentation] Confluent Platform 3.1 Introduction, Hans Jespersen http://www.slideshare.net/ConfluentInc/confluent-platform-31-introduction
- [Tutorial] A Practical Guide to Apache Kafka: Part 1 https://www.coshx.com/blog/2016/10/20/a-practical-guide-to-kafka1/
2016-10-21
- [Presentation] Log Ingestion: Kafka to Impala using Apex, Shubham Pathak, DataTorrent http://www.slideshare.net/DataTorrent/log-ingestion-kafka-to-impala-using-apex-67512542
- [Meetup] Intro to Apache Apex, Women in Big Data http://www.slideshare.net/DataTorrent/intro-to-apache-apex-women-in-big-data-67509833
- Introduction to Real-Time Data Processing, Yogi Devendra, DataTorrent http://www.slideshare.net/DataTorrent/introduction-to-realtime-data-processing-67515188
- [Presentation] In-Stream Processing Service Blueprint, Reference architecture for real-time Big Data applications, Anton Ovchinnikov http://www.slideshare.net/griddynamics/instream-processing-service-blueprint-reference-architecture-for-realtime-big-data-applications
- [Presentation] Spark Streaming into context http://www.slideshare.net/davidmartinezrego/spark-streaming-into-context
- [Presentation] IoT & Cortana Analytics http://www.slideshare.net/arunkumarmcsd/iot-67487153
- [Blog] Apache Storm Topology Tuning Approach for Squirrels https://community.hortonworks.com/articles/62852/feed-the-hungry-squirrel-series-storm-topology-tun.html
2016-10-22
- [Conference Talk] Streaming Engines for Big Data, Voxxed Days Thessaloniki http://www.slideshare.net/stavroskontopoulos/voxxed-days-thessaloniki-21102016-streaming-engines-for-big-data
2016-10-23
- [Blog] Developing Utility Bolts for Apache Storm http://www.river-of-bytes.com/2016/10/developing-utility-bolts-for-apache.html
2016-10-24
- [Blog] Jay Kreps on Distributed Stream Processing with Apache Kafka and Kafka Streams http://nhub.news/2016/10/jay-kreps-on-distributed-stream-processing-with-apache-kafka-and-kafka-streams/
- [Blog] Released Mocked Streams for Apache Kafka https://www.madewithtea.com/released-mocked-streams-for-apache-kafka.html
- [Blog] Apache Kafka for Real-Time Retail at Walmart Labs http://www.confluent.io/blog/apache-kafka-item-setup/
- [News] Announcing the release of Apache Samza 0.11.0 https://blogs.apache.org/samza/date/20161024
- [Blog] Flink: Worth a Second Look https://dzone.com/articles/flink-worth-a-second-look
- [Article] Kafka-Python Documentation https://media.readthedocs.org/pdf/kafka-python/master/kafka-python.pdf
2016-10-25
- [Conference Talk] The Evolution of Massive Scale Data Processing: Strata + Hadoop World NYC 2016, Tyler Akidauhttps://www.youtube.com/watch?v=9J_cWustI-A&feature=youtu.be
- [Meetup] Big Data Streaming Platform Ecosystem. Reza Farivar. Chicago Real-Time Streaming Analytics Meetup https://www.meetup.com/ChicagoRealTimeStreamingAnalytics/events/234676872/
- [Meetup] Apache Spark 101, Chicago Spark Users http://www.meetup.com/Chicago-Spark-Users/events/233999667/
- [Podcast] Managed Kafka with Tom Crayford http://softwareengineeringdaily.com/2016/10/25/managed-kafka-with-tom-crayford/
- [Blog] Additional Stage Libraries with dockerized StreamSets https://guidoschmutz.wordpress.com/2016/10/25/additional-stage-libraries-with-dockerized-streamsets/
- [Blog] Contributing to the StreamSets Data Collector Community https://streamsets.com/blog/contributing-streamsets-data-collector-community/
- [Meetup] Real-time Analytics on Medical Device Data with Kudu and Streamsets, Big Data Madison https://www.meetup.com/BigDataMadison/events/228352024/
- [Podcast] SE-Radio Episode 272: Frances Perry on Apache Beam http://www.se-radio.net/2016/10/se-radio-episode-272-frances-perry-on-apache-beam/
- [Blog] Les Défis du Stream Processing et de l’Architecture Lambda https://www.infoq.com/fr/news/2016/10/stream-processing-lambda
- [Blog] Kafka and Oracle databases – the new BFFs (Why are we so interested in Kafka – Part 2) http://blog.dbvisit.com/kafka-and-oracle-databases-the-new-bffs-why-are-we-so-interested-in-kafka-part-2/
- [Presentation] Multi tier, multi-tenant, multi-problem Kafka, Todd Palino, Linkedin. http://www.slideshare.net/ToddPalino/multi-tier-multitenant-multiproblem-kafka
- [Meetup] When your stream is the system of record http://www.slideshare.net/Naveen1914/map-r-seattle-streams-meetup-oct-2016
- [Presentation] Real-Time Streaming: Intro to Amazon Kinesis, Roy Ben-Alta http://www.slideshare.net/AmazonWebServices/realtime-streaming-intro-to-amazon-kinesis-67650919
- [Blog] Confluent Streaming Platform and Syncsort Data Management: Bringing Big Data to Life, Paige Roberts https://www.confluent.io/blog/confluent-and-syncsort-forge-central-nervous-system-for-streaming-and-at-rest-data
2016-10-26
- [Blog] Setting Up an Apache Spark Powered Recommendation Engine http://www.talend.com/blog/2016/10/26/setting-up-an-apache-spark-powered-recommendation-engine
- [Webinar] How-To Guide for new features of Hortonworks Dataflow 2.0 http://hortonworks.com/webinar/guide-new-features-hortonworks-dataflow-2-0/ Slides: http://www.slideshare.net/hortonworks/webinar-series-part-5-new-features-of-hdf-5
- [Blog] Windowing data in Big Data Streams - Spark, Flink, Kafka, Akka by Adam Warski https://softwaremill.com/windowing-in-big-data-streams-spark-flink-kafka-akka/
- [Blog] Unifying Stream Processing and Interactive Queries in Apache Kafka http://www.confluent.io/blog/unifying-stream-processing-and-interactive-queries-in-apache-kafka/
- [Whiteboard Walkthrough] How to Replicate Streaming Data Across Data Centers with MapR Streams https://www.mapr.com/blog/how-replicate-streaming-data-across-data-centers-mapr-streams-whiteboard-walkthrough
- [Webinar] How-To Guide for New Features of Hortonworks DataFlow 2.0 http://hortonworks.com/webinar/guide-new-features-hortonworks-dataflow-2-0/
- [Presentation] Real time data ingestion and Hybrid Cloud http://www.slideshare.net/nsabharwal/real-time-data-ingstion-and-hybrid-cloud
- [News] Streaming Analytics Market by Type, Applications, Vertical, Regions- Global Forecast to 2021 http://news.sys-con.com/node/3941414
- [Presentation] New Features in Samza 0.10.0, Navina Ramesh (LinkedIn) https://www.youtube.com/watch?v=t6Zb2_ldbdE
- [Presentation] Performance benchmarking for Samza , Tao Feng (LinkedIn),https://www.youtube.com/watch?v=sqGwDXGKfc4
- [Presentation Simplifying Big Data in Apache Spark 2.0, Matei Zaharia http://www.slideshare.net/databricks/spark-summit-eu-2016-keynote-simplifying-big-data-in-apache-spark-20
- [Conference Talk] How we built an event-time merge of two kafka-streams with spark-streaming? Ralf Sigmund & Sebastian Schröder, Spark Summit Europe 2016 http://www.slideshare.net/sistar/how-we-built-an-eventtime-merge-of-two-kafkastreams-with-sparkstreaming
- [Workshop] Building a Streaming Data Platform on AWS http://www.slideshare.net/AmazonWebServices/workshop-building-a-streaming-data-platform-on-aws-67655459
2016-10-27
- [Webinar] Data Integration with Apache Kafka - 3 out of 6 http://www.confluent.io/apache-kafka-talk-series/data-integration-with-Apache-Kafka
- [Conference Talk] The Next AMPLab: Real-Time, Intelligent, and Secure Computing http://www.slideshare.net/SparkSummit/the-next-amplab-realtime-intelligent-and-secure-computing Spark Summit EU talk by Ion Stoica
- [Presentation] Building Data Pipelines with Spark and StreamSets http://www.slideshare.net/metadaddy/building-data-pipelines-with-spark-and-streamsets
- [Video] Apache Flink Tutorial https://www.youtube.com/watch?v=PplPG0-njH8
- [Webinar] Data Integration with Apache Kafka http://www.confluent.io/apache-kafka-talk-series/data-integration-with-Apache-Kafka
- [Webinar] Real-time Data Processing using AWS Lambda https://www.youtube.com/watch?v=V10n9Uahk_M
- [Presentation] What's new in spark 2.0? http://www.slideshare.net/oluies/whats-new-in-spark-20
- [Blog] Is it time you started thinking about event stream processing? http://blogs.sas.com/content/hiddeninsights/2016/10/27/is-it-time-you-started-thinking-about-event-stream-processing/
- [Blog] Use Power BI to visualize data from an Apache Storm topology https://azure.microsoft.com/en-us/documentation/articles/hdinsight-storm-power-bi-topology/
- [Blog] “Life does not happen in batches …” introducing World of Streaming https://www.womeninbigdata.org/2016/10/27/life-does-not-happen-in-batches-introducing-world-of-streaming/
- [News] Kafka Gets Server, API Upgrades https://www.datanami.com/2016/10/27/kafka-gets-server-api-upgrades/
- [Blog] Guide to the New Features of Hortonworks DataFlow 2.0 by Haimo Liu & Bryan Bende http://hortonworks.com/blog/guide-new-features-hortonworks-dataflow-2-0/
- [Presentation] Streaming Data integration with Apache Kafka, David Tucker http://www.slideshare.net/ConfluentInc/data-integration-with-apache-kafka
- [Conference Talk] Apache NiFi 1.0 in Nutshell, Hadoop Summit Tokyo 2016 http://www.slideshare.net/KojiKawamura/apache-nifi-10-in-nutshell
- [Presentation] Online Model Updating with Spark Streaming http://www.slideshare.net/KeiraZhou2/online-model-updating-with-spark-streaming
2016-10-28
- [Presentation] Simplifying Distributed Systems Using Apache Kafka, John Holland https://objectpartners.com/2016/10/28/simplifying-distributed-systems-using-apache-kafka/ Video: https://www.youtube.com/watch?v=P51veBFwTl0 Slides: https://drive.google.com/file/d/0B-3uTdev-3jvTXhtZHpaWjMtd2c/view
- [Presentation] Create Pipeline: Real-Time Streaming and Exactly-Once Semantics with Kafka Video: https://www.youtube.com/watch?v=aofMlYiXAeQ
- [Presentation] Apache Kafka at LinkedIn - "Multi-Tier, Multi-Tenant, Multi-Problem Kafka" Video: https://www.youtube.com/watch?v=wMh6rwztT80
- [Presentation] Apache Kafka on Azure HDInsight https://www.youtube.com/watch?v=00R3ttLH9J4
- [Presentation] dtGateway - Making Apache Apex Operable http://www.slideshare.net/DataTorrent/dtgateway-making-apache-apex-operable-67809738
- [Presentation] Spark 2 http://www.slideshare.net/zaleslaw/joker16-spark-2-api-changes-structured-streaming-encoders/
- [Conference Talk] Apache NiFi 1.0 in Nutshell Slides: http://www.slideshare.net/KojiKawamura/apache-nifi-10-in-nutshell
- [Presentation] MapR Streams and Kafka https://www.youtube.com/watch?v=vc-XlCdh_F0
- [Blog] Hadoop, Storm, Samza, Spark, and Flink: Big Data Frameworks Compared https://www.digitalocean.com/community/tutorials/hadoop-storm-samza-spark-and-flink-big-data-frameworks-compared
- [Article] Containers and microservices find home in Hadoop ecosystem http://searchdatamanagement.techtarget.com/news/450401973/Containers-and-microservices-find-home-in-Hadoop-ecosystem
- [Presentation] Apache Kafka at LinkedIn - "Multi-Tier, Multi-Tenant, Multi-Problem Kafka"https://www.youtube.com/watch?v=wMh6rwztT80
- [Presentation] In-Stream Processing Service Blueprint- Reference Architecture for real-time Big Data Applications, Anton Ovchinnikov http://www.slideshare.net/ceesecr/ss-67801917
- [Meetup] Real-Time Streaming Data on AWS http://www.slideshare.net/AmazonWebServices/deep-dive-and-best-practices-on-real-time-streaming-applications-nycloftoct2016
2016-10-29
- [Presentation] Streaming Analytics and Internet of Things http://www.slideshare.net/LearnWTB/streaming-analytics-and-internet-of-things-geesara-prathap
- [Presentation] Updating materialized views and caches using kafka, Zach Cox http://www.slideshare.net/zacharycox/updating-materialized-views-and-caches-using-kafka
- [Tutorial] Storing SysDig Streams with Apache NiFi https://dzone.com/articles/storing-sysdig-streams-with-apache-nifi
2016-10-30
- [Blog] Build an ETL Pipeline With Kafka Connect via JDBC Connectors https://dzone.com/articles/build-an-etl-pipeline-with-kafka-connect-via-jdbc
- [Blog] NiFi and HTTP Post Configuration, Tomas Zezula. http://www.tomaszezula.com/2016/10/30/nifi-and-http-post-configuration/
2016-10-31
- [Blog] How to Persist Kafka Data as JSON in NoSQL Storage Using MapR Streams and MapR-DB? by Ian Downard https://www.mapr.com/blog/how-persist-kafka-data-json-nosql-storage-using-mapr-streams-and-mapr-db
- [Blog] Microservices and Stream Processing Architecture at Zalando Using Apache Flink https://www.infoq.com/news/2016/10/zalando-stream-processing-flink
- [Blog] What's new in StormCrawler 1.2 http://digitalpebble.blogspot.ch/2016/10/whats-new-in-stormcrawler-12.html
原创文章版权归过往记忆大数据(过往记忆)所有,未经许可不得转载。
本文链接: 【大数据流处理系统精彩资源月刊(第2期)】(https://www.iteblog.com/archives/1818.html)