Cargando…

Case study : how Pinterest built a stream processing platform with Apache Flink.

Facing rapid growth and competition in its online business, Pinterest tech stacks demanded large-scale stateful online processing technology to unlock multiple top initiatives. From lambda functions like micro services to Kafka Stream and Spark streaming, Pinterest explored all of these options and...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Formato: Electrónico Video
Idioma:Inglés
Publicado: [Place of publication not identified] : O'Reilly Media, Inc., 2022.
Edición:[First edition].
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cgm a22000007i 4500
001 OR_on1299175189
003 OCoLC
005 20231017213018.0
006 m o c
007 vz czazuu
007 cr cnannnuuuuu
008 220224s2022 xx 059 o vleng d
040 |a ORMDA  |b eng  |e rda  |e pn  |c ORMDA  |d OCLCO  |d OCLCF 
035 |a (OCoLC)1299175189 
037 |a 0636920672371  |b O'Reilly Media 
050 4 |a TK5105.386 
082 0 4 |a 006.7/876  |2 23 
049 |a UAMI 
245 0 0 |a Case study :  |b how Pinterest built a stream processing platform with Apache Flink. 
250 |a [First edition]. 
264 1 |a [Place of publication not identified] :  |b O'Reilly Media, Inc.,  |c 2022. 
300 |a 1 online resource (1 video file (59 min.)) :  |b sound, color. 
306 |a 005900 
336 |a two-dimensional moving image  |b tdi  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
344 |a digital  |2 rdatr 
347 |a video file  |2 rdaft 
380 |a Instructional films  |2 lcgft 
511 0 |a Host: David Cates ; presenter, Chen Qin. 
511 0 |a Host, 
520 |a Facing rapid growth and competition in its online business, Pinterest tech stacks demanded large-scale stateful online processing technology to unlock multiple top initiatives. From lambda functions like micro services to Kafka Stream and Spark streaming, Pinterest explored all of these options and decided to go with a single open source stream processing platform, powered by Apache Flink. Starting from a legacy batch-only data stack and varying levels of expertise among the user team, Pinterest's stream processing platform has evolved in multiple stages into a reliable platform that enables data-driven products and timely decision making. In the first stage, facing the demand of running large stateful applications, we built connectors that worked with Pinterest infrastructure, and streamlined user education with embedded training to fine tune each use case. In the next stage, we onboarded ads real-time ingestion which predict user match with 50% + accuracy in near real time. We also onboarded tier 1 advertiser spend unified calculation that has very stringent SLO in a very tight timeline. As the platform grew, we had to build a more scalable way to offer non-stream processing or infrastructure background engineered to unblock real-time machine learning use cases. Thus, reprocess, backfill source, live debugging automation, verification, and deployment automation, as well as a dependency injection based job editor, were built into the platform and became widely adopted, empowering a large number of jobs to production in a relatively small amount of time. Eventually, features like unified shopping catalog indexing, content safety and trust, as well as deduplicate 4b+ images on the platform in near real-time were built and launched. And with additional investment in technology and ecosystem teams, Pinterest's stream processing platform is accelerating transforming machine learning use cases, online processing, and data warehousing into near real-time. This case study reviews how Pinterest chose Apache Flink as the technology behind its stream processing platform, how the platform enabled critical use cases and a user base that scaled out and evolved along with product innovation, and lessons learned in implementing and developing this platform. What you will learn--and how to apply it By the end of this case study the viewer will understand: How and why Pinterest chose Apache Flink over other stream processing offerings How Pinterest uses stream processing to unlock business values and what it built to enable those accomplishments How the Pinterest stream processing team evolved and grew its offerings to scale out in alignment with business needs in different phases And the viewer will be able to: Kick start building a stream processing platform Evaluate use case fits in stream processing Avoid pitfalls Pinterest experienced in growing its stream processing platform This case study is for you if... You're a data infrastructure team lead or manager who is interested in implementing stream processing offerings in your organization You're a business executives or technology leader who is looking to add value and transform current ETL based data warehousing to near real-time Recommended follow-up: Unified Flink Source at Pinterest: Streaming Data Processing (article) Detecting Image Similarity in (Near) Real-time Using Apache Flink (article) Pinterest Visual Signals Infrastructure: Evolution from Lambda to Kappa Architecture (article) Real-time experiment analytics at Pinterest using Apache Flink (article). 
588 0 |a Online resource; title from title details screen (O'Reilly, viewed February 24, 2022). 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Streaming technology (Telecommunications) 
650 0 |a Big data. 
650 2 |a Webcasts as Topic 
650 6 |a En continu (Télécommunications) 
650 6 |a Données volumineuses. 
650 7 |a Big data.  |2 fast  |0 (OCoLC)fst01892965 
650 7 |a Streaming technology (Telecommunications)  |2 fast  |0 (OCoLC)fst01134637 
655 2 |a Webcast 
655 7 |a Instructional films.  |2 fast  |0 (OCoLC)fst01726236 
655 7 |a Internet videos.  |2 fast  |0 (OCoLC)fst01750214 
655 7 |a Nonfiction films.  |2 fast  |0 (OCoLC)fst01710269 
655 7 |a Instructional films.  |2 lcgft 
655 7 |a Nonfiction films.  |2 lcgft 
655 7 |a Internet videos.  |2 lcgft 
655 7 |a Films de formation.  |2 rvmgf 
655 7 |a Films autres que de fiction.  |2 rvmgf 
655 7 |a Vidéos sur Internet.  |2 rvmgf 
700 1 |a Cates, David,  |e host. 
700 1 |a Qin, Chen,  |e presenter. 
710 2 |a O'Reilly (Firm),  |e publisher. 
856 4 0 |u https://learning.oreilly.com/library/view/~/0636920672371/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
994 |a 92  |b IZTAP