1. A suffix array is an array containing all suffixes of a string sorted in lexicographic order. It allows for efficient string searches, including prefix searches, substring searches, and near neighbor searches.
2. The suffix array can be used to find the starting positions of all occurrences of a pattern or substring within the text in time linear to the pattern length. It supports finding the longest common prefix between two strings in constant time.
3. Suffix arrays are useful for text indexing and string searching, including applications like full text search, bioinformatics, data compression, string mining, and more. They allow locating patterns in strings and calculating similarities between strings.
The document discusses using the Coq proof assistant to implement and verify algorithms in Ruby. It mentions encoding a Base64 algorithm in Coq and extracting it to OCaml, Haskell, Scheme and potentially Ruby. It provides examples of defining and proving properties of a factorial function in Coq and extracting implementations to other languages. It also discusses using Coq to model and verify other algorithms like sorting before generating Ruby implementations.
La institución educativa Doce de Octubre participa en el programa "Hacia la configuración de docentes críticos, reflexivos con sus prácticas en el aula" del MEN. Doce docentes de educación secundaria recibirán capacitación y acompañamiento de tutores en el aula. La rectora Ángela María Chaverra Sierra asistió al primer taller de formación para rectores sobre gerencia del aprendizaje institucional.
El documento presenta un proyecto de educación sexual para estudiantes de una institución educativa. El proyecto busca orientar a los estudiantes en el manejo de emociones y sentimientos, y brindarles herramientas sobre su sexualidad. Se describen los objetivos, marco legal y teórico, metodología, recursos y cronograma de actividades.
This document discusses using parallel computing in R with the snow package. It provides an overview of using snow to distribute computations across multiple CPUs. Examples are given showing how snow can be used with functions like parApply to speed up matrix multiplication by performing the operation in parallel on a cluster. The document also discusses using snow together with Rmpi and a job scheduler like Sun Grid Engine to enable parallel computing on a computing cluster.
This document discusses using Python and PyData tools for baseball analytics. It introduces Shinichi Nakagawa, a baseball analyst and Python expert. It explains common PyData tools like Grafana, Redash and Jupyter Notebook, and how they can be used to visualize and analyze baseball metrics and stats. It also discusses using Python and scraping to analyze run creation (RC) and run creation per 27 outs (RC27) stats to evaluate player and team performance.
The document discusses using the gmap tool to run Next Generation Sequencing (NGS) analysis on a Sun Grid Engine computing cluster. It describes how gmap submits NGS jobs like Tophat to the cluster using the qsub command and monitors them with qstat. It also mentions developing a Twitter bot called togofarm bot and other automated NGS and social media tools used at the Database Center for Life Science.
The document discusses using the Coq proof assistant to implement and verify algorithms in Ruby. It mentions encoding a Base64 algorithm in Coq and extracting it to OCaml, Haskell, Scheme and potentially Ruby. It provides examples of defining and proving properties of a factorial function in Coq and extracting implementations to other languages. It also discusses using Coq to model and verify other algorithms like sorting before generating Ruby implementations.
La institución educativa Doce de Octubre participa en el programa "Hacia la configuración de docentes críticos, reflexivos con sus prácticas en el aula" del MEN. Doce docentes de educación secundaria recibirán capacitación y acompañamiento de tutores en el aula. La rectora Ángela María Chaverra Sierra asistió al primer taller de formación para rectores sobre gerencia del aprendizaje institucional.
El documento presenta un proyecto de educación sexual para estudiantes de una institución educativa. El proyecto busca orientar a los estudiantes en el manejo de emociones y sentimientos, y brindarles herramientas sobre su sexualidad. Se describen los objetivos, marco legal y teórico, metodología, recursos y cronograma de actividades.
This document discusses using parallel computing in R with the snow package. It provides an overview of using snow to distribute computations across multiple CPUs. Examples are given showing how snow can be used with functions like parApply to speed up matrix multiplication by performing the operation in parallel on a cluster. The document also discusses using snow together with Rmpi and a job scheduler like Sun Grid Engine to enable parallel computing on a computing cluster.
This document discusses using Python and PyData tools for baseball analytics. It introduces Shinichi Nakagawa, a baseball analyst and Python expert. It explains common PyData tools like Grafana, Redash and Jupyter Notebook, and how they can be used to visualize and analyze baseball metrics and stats. It also discusses using Python and scraping to analyze run creation (RC) and run creation per 27 outs (RC27) stats to evaluate player and team performance.
The document discusses using the gmap tool to run Next Generation Sequencing (NGS) analysis on a Sun Grid Engine computing cluster. It describes how gmap submits NGS jobs like Tophat to the cluster using the qsub command and monitors them with qstat. It also mentions developing a Twitter bot called togofarm bot and other automated NGS and social media tools used at the Database Center for Life Science.
Presentation by David Mytton about monitoring MongoDB at the MongoSV conference 3rd Dec 2010.
A full blog series covering everything in this presentation is at http://blog.boxedice.com/mongodb-monitoring/
This document provides a summary of Taro Matsuzawa's career and activities from 1981 to 2011. It notes that he was involved with various computer-related clubs and organizations in Japan, including English language societies and Linux communities. It also discusses his work on open source projects like Mozilla and Pantomime, an open source email library for iOS.
This document discusses Python and the pandas library. It provides an overview of Python's history and advantages, such as being easy to learn and having a large standard library. It also discusses the major Python data analysis packages NumPy, SciPy, matplotlib, and pandas. Pandas allows importing data from various sources, manipulating datasets, and performing operations on labeled and indexed data. The document also covers using pandas with other tools like Spark, visualization with matplotlib, and IDEs and notebooks for Python development.
Building a queueing system in MongoDB and monitoring your cluster. Presentation by David Mytton at MongoSF May 2011 and MongoDB London User Group July 2011.
Building a queueing system in MongoDB and monitoring your cluster. Presentation by David Mytton at MongoSF May 2011 and MongoDB London User Group July 2011.
This document discusses several topics related to data visualization and analysis including:
- Different types of tree data structures like binary trees and their properties
- Clustering algorithms like Ward clustering and how they are used to group similar data points
- Voronoi diagrams and how they can be used to partition space based on distances to objects
- Google's data analysis tools like Sawzall and BigTable and how they are used to analyze large datasets
Cassandra is a decentralized, distributed database management system that provides high availability and scalability without a single point of failure. It uses a distributed hash table architecture and provides consistently fast performance even with billions of rows spread across clusters of standard servers. The document discusses Cassandra's data model, data types, APIs, and examples of companies using Cassandra including Digg, Twitter, Facebook, and Mahalo to handle large amounts of data and high query volumes at scale.
The document discusses various mathematical concepts including trigonometric functions such as sine, cosine, and tangent. It covers trigonometric identities involving addition and subtraction formulas. It also discusses exponential functions, complex exponentials, and their relationship to trigonometric functions. Further, it briefly mentions applications of mathematics including GPS and Pokémon GO.
Presentation by David Mytton about monitoring MongoDB at the MongoUK conference 21st Mar 2011.
A full blog series covering everything in this presentation is at http://blog.boxedice.com/mongodb-monitoring/
This document outlines opportunities for improving an organization's processes and professional development. It identifies several areas that could benefit from streamlining, including operations, communication practices, training programs, and performance management. Goals include enhancing efficiency, promoting collaboration, and facilitating continued learning and growth for all employees. Implementation would aim to maximize productivity while maintaining a supportive work culture.
1. The document discusses a method for tracking objects in videos by colorizing videos in a self-supervised manner without manual annotations.
2. Key steps include colorizing each frame based on a reference frame, extracting features from the colorized frames to match objects across frames, and using the matches to propagate colors and improve tracking over time.
3. The method is able to track objects as they move, rotate or change scale in videos without any object segmentations or pose annotations during training or testing.
2018년 2월 24일 KCD2018에서 Google Polymer에 대하여 발표한 내용입니다. 이 발표에서는 웹, 하이브리드 앱 및 프로그레시브 웹 앱 개발을 위한 구글의 웹컴포넌트 라이브러리인 폴리머를 쉽고 재미있게 다룹니다. 웹컴포넌트, 폴리머에 대한 소개와 함께 폴리머 2.0의 특징을 소개합니다. 또한 modulizer, TypeScript, yarn, webpack의 도입을 추진하고 있는 폴리머 3.0 알파 버전의 주요변화를 알아봅니다.
This document discusses DevOps tools and practices on Kubernetes and OpenShift container platforms. It covers topics like:
1. Using Jenkins as a service on OpenShift for continuous integration and delivery.
2. Deploying web applications and microservices on Kubernetes, including technologies like circuit breakers.
3. Architectures for distributed and microservices systems, including service meshes.
4. DevOps tools available on OpenShift like Istio for traffic management between microservices.
This document discusses a Yahoo Hack Day event on February 8, 2017 where the author worked on an iOS app called ScrollingFollowView. It lists the tools used including GitLab, Sketch, Carthage, Kingfisher, and Realm. It also mentions planning 24 hours of work over 3 days and sharing the author's GitHub and Twitter accounts.
This document discusses Ruby, a programming language. It notes that SHIBATA Hiroshi, aka hsbt, is the administrator of the tDiary development team. It asks if the reader is interested in Ruby or Rails and provides a link to a slideshow about Ruby on Rails. It also mentions recruiting for Perl Association Japan and includes links to photos related to Ruby.
The document discusses the benefits and challenges of proof-driven development using the Coq proof assistant. It describes how Coq can be used to formally prove properties about code during development. However, it also notes that Coq has limitations when dealing with large numbers, which can cause stack overflows. It also discusses using Coq to formally specify the MessagePack serialization format and prove properties about it.
Presentation by David Mytton about monitoring MongoDB at the MongoSV conference 3rd Dec 2010.
A full blog series covering everything in this presentation is at http://blog.boxedice.com/mongodb-monitoring/
This document provides a summary of Taro Matsuzawa's career and activities from 1981 to 2011. It notes that he was involved with various computer-related clubs and organizations in Japan, including English language societies and Linux communities. It also discusses his work on open source projects like Mozilla and Pantomime, an open source email library for iOS.
This document discusses Python and the pandas library. It provides an overview of Python's history and advantages, such as being easy to learn and having a large standard library. It also discusses the major Python data analysis packages NumPy, SciPy, matplotlib, and pandas. Pandas allows importing data from various sources, manipulating datasets, and performing operations on labeled and indexed data. The document also covers using pandas with other tools like Spark, visualization with matplotlib, and IDEs and notebooks for Python development.
Building a queueing system in MongoDB and monitoring your cluster. Presentation by David Mytton at MongoSF May 2011 and MongoDB London User Group July 2011.
Building a queueing system in MongoDB and monitoring your cluster. Presentation by David Mytton at MongoSF May 2011 and MongoDB London User Group July 2011.
This document discusses several topics related to data visualization and analysis including:
- Different types of tree data structures like binary trees and their properties
- Clustering algorithms like Ward clustering and how they are used to group similar data points
- Voronoi diagrams and how they can be used to partition space based on distances to objects
- Google's data analysis tools like Sawzall and BigTable and how they are used to analyze large datasets
Cassandra is a decentralized, distributed database management system that provides high availability and scalability without a single point of failure. It uses a distributed hash table architecture and provides consistently fast performance even with billions of rows spread across clusters of standard servers. The document discusses Cassandra's data model, data types, APIs, and examples of companies using Cassandra including Digg, Twitter, Facebook, and Mahalo to handle large amounts of data and high query volumes at scale.
The document discusses various mathematical concepts including trigonometric functions such as sine, cosine, and tangent. It covers trigonometric identities involving addition and subtraction formulas. It also discusses exponential functions, complex exponentials, and their relationship to trigonometric functions. Further, it briefly mentions applications of mathematics including GPS and Pokémon GO.
Presentation by David Mytton about monitoring MongoDB at the MongoUK conference 21st Mar 2011.
A full blog series covering everything in this presentation is at http://blog.boxedice.com/mongodb-monitoring/
This document outlines opportunities for improving an organization's processes and professional development. It identifies several areas that could benefit from streamlining, including operations, communication practices, training programs, and performance management. Goals include enhancing efficiency, promoting collaboration, and facilitating continued learning and growth for all employees. Implementation would aim to maximize productivity while maintaining a supportive work culture.
1. The document discusses a method for tracking objects in videos by colorizing videos in a self-supervised manner without manual annotations.
2. Key steps include colorizing each frame based on a reference frame, extracting features from the colorized frames to match objects across frames, and using the matches to propagate colors and improve tracking over time.
3. The method is able to track objects as they move, rotate or change scale in videos without any object segmentations or pose annotations during training or testing.
2018년 2월 24일 KCD2018에서 Google Polymer에 대하여 발표한 내용입니다. 이 발표에서는 웹, 하이브리드 앱 및 프로그레시브 웹 앱 개발을 위한 구글의 웹컴포넌트 라이브러리인 폴리머를 쉽고 재미있게 다룹니다. 웹컴포넌트, 폴리머에 대한 소개와 함께 폴리머 2.0의 특징을 소개합니다. 또한 modulizer, TypeScript, yarn, webpack의 도입을 추진하고 있는 폴리머 3.0 알파 버전의 주요변화를 알아봅니다.
This document discusses DevOps tools and practices on Kubernetes and OpenShift container platforms. It covers topics like:
1. Using Jenkins as a service on OpenShift for continuous integration and delivery.
2. Deploying web applications and microservices on Kubernetes, including technologies like circuit breakers.
3. Architectures for distributed and microservices systems, including service meshes.
4. DevOps tools available on OpenShift like Istio for traffic management between microservices.
This document discusses a Yahoo Hack Day event on February 8, 2017 where the author worked on an iOS app called ScrollingFollowView. It lists the tools used including GitLab, Sketch, Carthage, Kingfisher, and Realm. It also mentions planning 24 hours of work over 3 days and sharing the author's GitHub and Twitter accounts.
This document discusses Ruby, a programming language. It notes that SHIBATA Hiroshi, aka hsbt, is the administrator of the tDiary development team. It asks if the reader is interested in Ruby or Rails and provides a link to a slideshow about Ruby on Rails. It also mentions recruiting for Perl Association Japan and includes links to photos related to Ruby.
The document discusses the benefits and challenges of proof-driven development using the Coq proof assistant. It describes how Coq can be used to formally prove properties about code during development. However, it also notes that Coq has limitations when dealing with large numbers, which can cause stack overflows. It also discusses using Coq to formally specify the MessagePack serialization format and prove properties about it.
The document discusses how handshake joins could be used by soccer players to perform stream joins in a scalable and parallel manner. Handshake joins involve streams flowing past each other in opposite directions, with each core locally evaluating tuples as they pass through. This approach allows for very high degrees of parallelism across multiple cores, CPUs, and specialized hardware like FPGAs. The method uses techniques like two-phase forwarding and load balancing to process tuples in an online, symmetric, and distributed way.
This document discusses MongoDB's oplog and logging operations. It covers:
1. The createOplog function which creates the oplog as a capped collection in the local database.
2. The logOp function which logs operations to the oplog in BSON format with fields like timestamp, operation type, namespace, and operation object.
3. The _logOp and _logOpOld functions which handle actually writing to the oplog, with _logOpOld writing directly to the data file.
9. 10: i SA[i]:
7: ippi
4: issippi 10 7 4 1 0 9 8 6 3 5 2
1: ississippi T[i]:
0: mississippi
9: pi m i s s i s s i p p i
8: ppi
6: sippi
3: sissippi 6
5: ssippi T[SA[6]]
2: ssissippi → T[8]
→ “ppi”
9