From d74a391de587befd2ebdc74ad5014a8ceec0f951 Mon Sep 17 00:00:00 2001 From: "Y. Yu" <54338793+PursuitOfDataScience@users.noreply.github.com> Date: Sat, 11 Jun 2022 16:15:17 -0400 Subject: [PATCH] Update streaming.Rmd Fixed an error. --- streaming.Rmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/streaming.Rmd b/streaming.Rmd index e4d945e..8e130a1 100644 --- a/streaming.Rmd +++ b/streaming.Rmd @@ -99,7 +99,7 @@ stream <- stream_read_text(sc, "source/") %>% stream_write_text("destination/") ``` -The streams starts running with `stream_write_*()`; once executed, the stream will monitor the _`source`_ path and process data into the _++destination /++_ path as it arrives. +The stream starts running with `stream_write_*()`; once executed, the stream will monitor the _`source`_ path and process data into the _++destination /++_ path as it arrives. We can use `stream_generate_test()` to produce a file every second containing lines of text that follow a given distribution; you can read more about this in [Appendix](#appendix). In practice, you would connect to existing sources without having to generate data artificially. We can then use `view_stream()` to track the rows per second (rps) being processed in the source, and in the destination, and their latest values over time: