Skip to content

Commit b7ab6a3

Browse files
committed
Copy-edits, cleaning up some of my failed copy-paste from the internal wiki
1 parent 10318ad commit b7ab6a3

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

_posts/2020-02-19-modernizing-an-old-data-pipeline.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ to where it is today. However, it has been in maintenance mode for some time.
3636
As a result, it's struggling to meet the needs of Scribd's fast growing
3737
engineering team,. Since [Scribd is moving more and more into the
3838
cloud](/blog/2019/migrating-kafka-to-aws.html),
39-
decided that now is a good time for us to step back and redesign the system for the
39+
we decided that now is a good time for us to step back and redesign the system for the
4040
future.
4141

4242
We need a modernized workflow orchestration system to help drastically improve
@@ -63,7 +63,7 @@ reduce coupling and speed up development cycles.
6363
**Ownership:** Today, we have dedicated engineers keeping eyes on nightly runs to
6464
notify workflow owners if anything goes wrong. The web UI doesn't some of the
6565
common maintenance actions like killing a running tasks. This, combined with
66-
lack of builtin monitoring and alerting support within the orchestration
66+
lack of built-in monitoring and alerting support within the orchestration
6767
system, means even if workflow owners want to take full ownership of their
6868
tasks, there is no easy way to accomplish it. We need to flip this around and
6969
empower workflow owners to take care of their own tasks end to end. This is the
@@ -83,9 +83,9 @@ new system.
8383

8484
**Cloud native:** As we migrate its infrastructure from a datacenter to the cloud, the
8585
new system will need to be able to run smoothly in the cloud and integrate
86-
nicely with various SASS offerings like Datadog, Pagerduty and Sentry.
86+
nicely with various software-as-a-service offerings like Datadog, Pagerduty and Sentry.
8787

88-
We basically Had two options: retrofit Datapipe or
88+
We basically had two options: retrofit Datapipe or
8989
pick a well maintained open source project as the building block. After lots of
9090
prototyping and careful evaluation, we decided to adopt [Apache Airflow](https://airflow.apache.org).
9191

@@ -108,7 +108,7 @@ Each one of the above items warrants a blog post of its own. We will be sharing
108108
what we have learned in more detail throughout this series of blog posts.
109109

110110
At Scribd, we embrace open source and try to contribute back to the community
111-
as much as we can. Since start of this internal project, we have contributed
111+
as much as we can. Since the start of this internal project, we have contributed
112112
[more than 20 patches
113113
upstream](https://github.com/apache/airflow/pulls?utf8=%E2%9C%93&q=is%3Apr+author%3Ahouqp)
114114
to Airflow including EKS support, Pagerduty hooks, many bug fixes and

0 commit comments

Comments
 (0)