什么头什么颈| 梦见苹果是什么意思| 肝胆胰脾彩超查什么病| 胰腺炎是什么病| 铜陵有什么好玩的地方| 丁香泡水喝有什么功效和作用| 手术后为什么不让睡觉| 锦纶氨纶是什么面料| 白细胞满视野是什么意思| 笑气是什么气体| 做小月子要注意什么| 失物招领是什么意思| 做梦数钱是什么意思啊| 犹太人是什么人种| 感冒了吃什么药| eu是什么元素| 儿童枕头用什么枕芯好| 梅毒阳性是什么意思| 小孩反复发烧是什么原因| 青蒿素是什么| 贫嘴是什么意思| 梦见种地是什么意思| 舌苔厚是什么原因引起的| 查hpv挂什么科| 感染幽门螺旋杆菌会出现什么症状| 肚子疼呕吐是什么原因| 太阳穴有痣代表什么| 刷单是什么意思| 手关节疼痛是什么原因| 白带异味是什么原因| 晚上为什么睡不着| 2007属什么生肖| 此言念什么| 肺栓塞是什么意思| 咳嗽吃什么好的快| 女生食指戴戒指什么意思| 9月12是什么星座| 梦见自己的哥哥死了是什么意思| 来月经可以吃什么水果| 篮球中锋是干什么的| 成人打虫吃什么药| 四月十八日是什么日子| 羊配什么生肖最好| 红薯是什么季节的| 什么舞蹈| 淋巴发炎吃什么药好| 表哥的儿子叫什么| 苍蝇吃什么食物| 双侧卵巢多囊样改变是什么意思| 台风什么时候来| 白带是黄色是什么原因| 9.9是什么星座| 什么是体制内的工作| 狗咬人后狗为什么会死| 刀纸是什么| 劈腿是什么意思| 小腿酸胀痛是什么原因| 霰粒肿用什么药| g50是什么高速| AX是什么意思| 1969年是什么年| 排卵期过后是什么期| 禁令是什么意思| 口食读什么| 不复相见什么意思| 大众什么车最贵| 海水倒灌是什么意思| 它是什么用英语怎么说| susie是什么意思| 喘息性支气管炎吃什么药| 置之死地而后生是什么意思| 老虎头衣服是什么牌子| 低钾有什么症状和危害| 头皮痛是什么原因| 清淡饮食吃什么| caring什么意思| 喉咙痛喝什么汤好| 男生什么情况想种草莓| 榆钱是榆树的什么| xo兑什么饮料好喝| 日本为什么经常地震| 总胆红素偏高是什么病| 蟑螂中药名称叫什么| scj是什么意思| 乐属于五行属什么| 同房是什么意思| 血小板下降是什么原因| 士官是什么| gap什么意思| 什么大牌护肤品好用| 常温保存是什么意思| 什么的瞬间作文| 拉稀吃什么| 孩子肚子疼吃什么药| 什么是交感神经| 什么泡水喝对肝脏好| 荸荠是什么| 堤防是什么意思| 肠道菌群失调有什么症状| 属鸡的跟什么属相最配| 浅笑安然是什么意思| 8月是什么季节| 不作为什么意思| 输卵管堵塞有什么症状| 88年的龙是什么命| 肾上腺素有什么用| 人中附近长痘痘什么原因| 清静是什么意思| 梦见自己掉头发是什么征兆| 马齿苋能治什么病| 高血压吃什么降压药| 乙酰氨基酚是什么药| 酒量越来越差什么原因| 孕妇羊水多是什么原因造成的| 玫瑰痤疮是什么原因| 女人梦见老鼠什么征兆| 直接胆红素高是什么病| 为什么早上起来血压高| 洋葱不能跟什么一起吃| 什么时候放开二胎| 属猴的守护神是什么菩萨| 北京有什么好吃的美食| faleda是什么牌子的手表| 吃核桃有什么好处和坏处| 木瓜是什么季节的水果| 血管堵塞有什么症状| 幽门螺旋杆菌的症状吃什么药| 人力资源是做什么的| 土阜念什么| 肾阴虚是什么意思| 牙龈有点发黑是什么原因| 39岁属什么| 什么是纯净物| 脖子大是什么原因| 滥竽充数的充是什么意思| 鱼腥味是什么妇科病| 什么是基因| 什么的小球| 支元体阳性是什么意思| 10点是什么时辰| 发愿是什么意思| 宠物邮寄用什么快递| 脾肾气虚的症状是什么| 10月30日什么星座| 生理是什么意思| 脑膜瘤钙化意味着什么| 喝脱脂牛奶有什么好处| 蒸鱼豉油是什么| 面黄肌瘦是什么意思| 蓝色妖姬适合送什么人| 碱性土壤适合种植什么| 点痣挂什么科| 撇嘴表情什么意思| 七九年属什么的| 无孔不入是什么意思| 九牛一毛是什么意思| 胎脂是什么原因造成的| 鹊桥是什么意思| 荧光黄是什么颜色| 消融术是什么手术| 胃热是什么原因引起的| 7月28号是什么星座| 幽门螺旋杆菌什么意思| 小龙虾吃什么| 丙氨酸氨基转移酶是查什么的| 越来越瘦是什么原因| 健身吃蛋白粉有什么好处和坏处| 口腔癌早期有什么征兆| 四百分能上什么大学| 沙漠玫瑰什么时候开花| 折耳猫什么颜色最贵| 什么样的生活| 牛大力泡酒有什么功效| 浑身发热是什么原因| d是什么元素| 神经性耳鸣有什么症状| 千里江陵是什么意思| 查心脏挂什么科| 金青什么字| 妙哉妙哉是什么意思| 美丽的近义词是什么| 高反是什么意思| 巴基斯坦是什么语言| 化学学什么| 脂肪瘤吃什么药| 大象的耳朵有什么作用| 甘油三酯高吃什么食物好| 白斩鸡是什么意思| 三点水加一个心读什么| 鸭子烧什么配菜好吃| 阴湿是什么意思| 双子座爱吃什么| 喝苹果醋有什么好处| 湿疹是什么症状| 酒后喝什么解酒| 屁臭是什么原因| 田螺小子是什么意思| 出是什么意思| 经典是什么意思| 橘子是什么季节的水果| 看手指甲挂什么科室| 5年存活率是什么意思| 卵巢早衰吃什么可以补回来| top1是什么意思| 国资委什么级别| 梦见大火是什么意思| vans是什么牌子| 肠胃不好吃什么水果比较好| 什么叫根管治疗牙齿| 咖啡喝多了有什么危害| 肌酐下降是什么原因| 克罗恩病有什么症状| 头皮痒用什么止痒最好| 一什么香蕉| 北京为什么叫帝都| 平安喜乐什么意思| 三唑酮主治什么病害| 小孩不说话什么原因| 彼岸花又叫什么花| 寂寞什么意思| 丙二醇是什么东西| 筒子骨炖什么好吃| 1893年是什么年| 护士资格证有什么用| 什么的绿毯| 粉红色泡沫痰见于什么病| 眼神迷离是什么意思| 什么是桥本甲状腺炎| 梦见什么是受孕成功了| 舌苔厚白湿气重吃什么药| 过敏性鼻炎用什么药效果好| 夏天吃什么蔬菜好| 眼疲劳用什么眼药水| 7月1日是什么星座| 切勿是什么意思| 三轮体空什么意思| 复读是什么意思| 植物神经紊乱挂什么科| 什么是av| 肺纹理增多什么意思| 鸡蛋干配什么菜炒好吃| 颈动脉彩超能查出什么| 女人绝经是什么症状| 行房时硬度不够是什么原因| 发烧拉肚子吃什么药| 肛门出血什么原因| 醋泡脚有什么好处| 饸饹是什么| 桥本氏甲状腺炎吃什么药| 入木三分是什么生肖| 辽源有什么好玩的地方| sle是什么病| 铁蛋白高是什么原因| 痰栓是什么意思| 大便是红色的是什么原因| 致癌是什么意思| 法字五行属什么| 急性荨麻疹是什么原因引起的| 心率偏高是什么原因| 樱桃跟车厘子有什么区别| 什么叫五福临门| 尿道感染吃什么药好得快| 陈宝莲为什么自杀| 泡沫尿吃什么药| 百度Jump to content

319京东路由品类日开启福利赠送模式,路由器每满199减30

From mediawiki.org
This is an archive of all technical updates for the Wikimedia Enterprise project.
百度 不过,在被告上法庭后,丸美的制造商广州佳禾承认上述宣传单中的内容表述不规范,同时该公司也表明其确系一家中日合资的化妆品企业。


2025 - Q1-Q2

[edit]

Machine Readability

[edit]
  • Goal: To include structured data into our feeds and to make unstructured Wikimedia content available in pre-parsed formats
  • Recent Launches:

Content Integrity

[edit]
  • Goal: To provide more contextual information alongside each revision to help judge whether or not to trust the revision.
  • Recent Launches:

API Usability

[edit]
  • Goal: To improve the usability of Wikimedia Enterprise APIs
  • Recent Launches:
    • Chunking snapshots feature
      • Completed to reduce max size required for snapshot downloads
      • Added: Snapshot chunking, /v2/snapshots/*/chunks, to free accounts

2024 - Q3 & Q4

[edit]

Machine Readability

[edit]
  • Goal - To include structured data into our feeds and to make unstructured Wikimedia content available in pre-parsed formats
  • Recent launches:

API Usability

[edit]
  • Goal: To improve the usability of Wikimedia Enterprise APIs
  • Recent Launches:
    • Introductory API
      • Expanded no-cost option for new users to include additional free credits
    • Chunking snapshots feature
      • Completed in Q3 2024 to reduce max size required for snapshot downloads

2024 - Q2

[edit]

Machine Readability

[edit]
  • Goal - To include structured data into our feeds and to make unstructured Wikimedia content available in pre-parsed formats
  • Launches:
    • Structured Contents snapshots: early beta release of Structured Contents Snapshots endpoint, including pre-parsed articles (abstracts, main images, descriptions, infoboxes, sections) in bulk, and covering several languages. Alongside this release, we’re also making available a Hugging Face dataset of the new beta Structured Contents snapshots and inviting the general public to freely use and provide feedback. All of the information regarding the Hugging Face dataset is posted on our blog here.
    • Beta Structured Contents endpoint within On-demand API which gives users access to our team’s latest machine readability features, including the below:
      • Short Description (available in Structured Contents On-demand)
        • A concise explanation of the scope of the page written by Wikipedia and Wikidata editors. This allows rapid clarification and helps with topic disambiguation
      • Pre-parsed infoboxes (available in Structured Contents On-demand)
        • Infoboxes from Wikipedia articles to easily extract the important facts of the topic to enrich your entities.
      • Pre-parsed sections (available in Structured Contents On-demand)
        • Content sections from Wikipedia articles to easily extract and access information hidden deeper in the page.
      • Main Image (available in all Wikimedia Enterprise APIs)
        • The main image is curated by editors to represent a given article’s content. This can be used as a visual representation of the topic.
      • Summaries (aka `abstract`) (available in all Wikimedia Enterprise APIs)
        • Easy to ingest text included with each revision to provide a concise summary of the content without any need to parse HTML or Wikitext.

Content Integrity

[edit]
  • Goal: To provide more contextual information alongside each revision to help judge whether or not to trust the revision.
  • Launches
    • Maintenance Tags
      • Key enWiki tags that point to changes in credibility.
      • Small scale POC
    • Breaking News Beta [Realtime Streaming v2]
      • A boolean field detecting breaking news events to support prioritization when doing real-time ingestion of new Wikipedia pages
    • Liftwing ‘Revertrisk’
      • ORES ‘goodfaith’ and ‘damaging’ scores have been deprecated from our API responses. We are working on the integration of ‘revertrisk’ score to our API response objects.
    • No-Index tag per revision

API Usability

[edit]
  • Goal: To improve the usability of Wikimedia Enterprise APIs
  • Launches:
    • Snapshots
      • Filtering available snapshots to group snapshots to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • On-demand
      • Cross language project entity lookups to connect different language projects for faster knowledge graph ingestion.
      • NDJSON responses to enable data consistency across WME APIs
      • Filtering and customized response payloads
    • Realtime Batch
      • Filtering available batch updates to group files to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • Realtime Streaming
      • Realtime Streaming reconnection performance improvement
      • Shared credibility signals accuracy results
      • Shared latency distribution for Realtime Streaming events
      • Parallel consumption - enable users to open multiple connections to a stream simultaneously
      • More precise tracking - empower users to reconnect and seamlessly resume message consumption from the exact point where they left off
      • Event filtering by data field/value to narrow down revisions
      • Customized response payloads to control event size
      • Proper ordering of revisions to remove accidental overwrites
      • Lower event latency to ensure faster updates
      • NDJSON responses to enable data consistency across WME APIs


2024 - Q1

[edit]

Machine Readability

[edit]
  • Goal: To include structured data into our feeds and to make unstructured Wikimedia content available in pre-parsed formats
  • Launches:
    • The Structured Contents (beta) endpoint which gives users access to our team’s latest machine readability features, including:
      • Short Description: A concise explanation of the scope of the page written by Wikipedia and Wikidata editors. This allows rapid clarification and helps with topic disambiguation.
      • Pre-parsed infoboxes to easily extract the important facts of the topic to enrich your entities.
      • Preparsed sections from Wikipedia articles to easily extract and access information hidden deeper in the page.
    • Main Image available in all Wikimedia Enterprise APIs
      • The main image is curated by editors to represent a given article’s content. This can be used as a visual representation of the topic.
    • Summaries (aka `abstract`) available in all Wikimedia Enterprise APIs:
      • Easy to ingest text included with each revision to provide a concise summary of the content without any need to parse HTML or Wikitext.

Content Integrity

[edit]
  • Goal: To provide more contextual information alongside each revision to help judge whether or not to trust the revision.
  • Launches:
    • Maintenance Tags
      • Key enWiki tags that point to changes in credibility.
      • Small scale POC
      • Breaking News Beta [Realtime Streaming v2]
        • A boolean field detecting breaking news events to support prioritization when doing real-time ingestion of new Wikipedia pages
      • Liftwing
        • ORES ‘goodfaith’ and ‘damaging’ scores have been deprecated from our API responses. We are working on the integration of ‘revertrisk’ score to our API response objects.
      • No-Index tag per revision

API Usability

[edit]
  • Goal: To improve the usability of Wikimedia Enterprise APIs
  • Launches:
    • Snapshots
      • Filtering available snapshots to group snapshots to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • On-demand
      • Cross language project entity lookups to connect different language projects for faster knowledge graph ingestion.
      • NDJSON responses to enable data consistency across WME APIs
      • Filtering and customized response payloads
    • Realtime Batch
      • Filtering available batch updates to group files to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • Realtime Streaming
      • Shared credibility signals accuracy results
      • Shared latency distribution for Realtime Streaming events
      • Parallel consumption - enable users to open multiple connections to a stream simultaneously
      • More precise tracking - empower users to reconnect and seamlessly resume message consumption from the exact point where they left off
      • Event filtering by data field/value to narrow down revisions
      • Customized response payloads to control event size
      • Proper ordering of revisions to remove accidental overwrites
      • Lower event latency to ensure faster updates
      • NDJSON responses to enable data consistency across WME APIs

2023 - Q4

[edit]

Machine Readability

[edit]
  • Goal: To include structured data into our feeds and to make unstructured Wikimedia content available in pre-parsed formats
  • Launch:
    • Sections (in Structured Contents beta endpoint)
      • Preparsed sections from Wikipedia articles to easily extract and access information hidden deeper in the page.
      • The Structured Contents (beta) endpoint which gives users access to our team’s latest machine readability features, including
      • Short Description: A concise explanation of the scope of the page written by Wikipedia and Wikidata editors. This allows rapid clarification and helps with topic disambiguation.
      • Pre-parsed infoboxes to easily extract the important facts of the topic to enrich your entities.
    • Main Image link (in Snapshots and Realtime Streaming)
      • The main image is curated by editors to represent a given article’s content. This can be used as a visual representation of the topic.
    • Summaries (aka `abstract`) available in all Wikimedia Enterprise APIs:
      • Easy to ingest text included with each revision to provide a concise summary of the content without any need to parse HTML or Wikitext.

Content Integrity

[edit]
  • Goal: To provide more contextual information alongside each revision to help judge whether or not to trust the revision.
  • Recent Launch:
    • Maintenance Tags
      • Key enWiki tags that point to changes in credibility.
      • Small scale POC
      • Slight change in schema
  • Launches:
    • Version Diffs [Realtime Streaming v2]
      • Quantitative word changes in a new revision grouped by word attributes to provide understanding of the risk of a new revision’s changes.
    • Breaking News Beta [Realtime Streaming v2]
      • A boolean field detecting breaking news events to support prioritization when doing real-time ingestion of new Wikipedia pages
    • Liftwing
      • ORES ‘goodfaith’ and ‘damaging’ scores have been deprecated from our API responses. We are working on the integration of ‘revertrisk’ score to our API response objects.
    • No-Index tag per revision

API Usability:

[edit]
  • Goal: To improve the usability of Wikimedia Enterprise APIs
  • Launches:
    • Snapshots
      • Filtering available snapshots to group snapshots to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • On-demand
      • Cross language project entity lookups to connect different language projects for faster knowledge graph ingestion.
      • NDJSON responses to enable data consistency across WME APIs
      • Filtering and customized response payloads
    • Realtime Batch
      • Filtering available batch updates to group files to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • Realtime Streaming
      • Shared credibility signals accuracy results
      • Shared latency distribution for Realtime Streaming events
      • Parallel consumption - enable users to open multiple connections to a stream simultaneously
      • More precise tracking - empower users to reconnect and seamlessly resume message consumption from the exact point where they left off
      • Event filtering by data field/value to narrow down revisions
      • Customized response payloads to control event size
      • Proper ordering of revisions to remove accidental overwrites
      • Lower event latency to ensure faster updates
      • NDJSON responses to enable data consistency across WME APIs

2023 - Q3

[edit]

Machine Readability

[edit]
  • Goal: To include structured data into our feeds and to make unstructured Wikimedia content available in pre-parsed formats
  • Launches:
    • The Structured Contents (beta) endpoint which gives users access to our team’s latest machine readability features, including:
      • Short Description: A concise explanation of the scope of the page written by Wikipedia and Wikidata editors. This allows rapid clarification and helps with topic disambiguation.
      • Pre-parsed infoboxes to easily extract the important facts of the topic to enrich your entities.
      • Main Image link (in Snapshots and Realtime Streaming
        • The main image is curated by editors to represent a given article’s content. This can be used as a visual representation of the topic.
    • Summaries (aka `abstract`) available in all Wikimedia Enterprise APIs:
      • Easy to ingest text included with each revision to provide a concise summary of the content without any need to parse HTML or Wikitext.


Content Integrity

[edit]
  • Goal: To provide more contextual information alongside each revision to help judge whether or not to trust the revision.
  • Launches
    • Version Diffs [Realtime Streaming v2]
      • Quantitative word changes in a new revision grouped by word attributes to provide understanding of the risk of a new revision’s changes.
    • Breaking News Beta [Realtime Streaming v2]
      • A boolean field detecting breaking news events to support prioritization when doing real-time ingestion of new Wikipedia pages

API Usability

[edit]
  • Goal: To improve the usability of Wikimedia Enterprise APIs
  • Launches:
    • Snapshots
      • Filtering available snapshots to group snapshots to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • On-demand
      • Cross language project entity lookups to connect different language projects for faster knowledge graph ingestion.
      • NDJSON responses to enable data consistency across WME APIs
      • Filtering and customized response payloads
    • Realtime Batch
      • Filtering available batch updates to group files to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • Realtime Streaming
      • Parallel consumption - enable users to open multiple connections to a stream simultaneously
      • More precise tracking - empower users to reconnect and seamlessly resume message consumption from the exact point where they left off
      • Event filtering by data field/value to narrow down revisions
      • Customized response payloads to control event size
      • Proper ordering of revisions to remove accidental overwrites
      • Lower event latency to ensure faster updates
      • NDJSON responses to enable data consistency across WME APIs


2023 - Q1&2

[edit]

Machine Readability

[edit]
  • Goal: To include structured data into our feeds and to make unstructured Wikimedia content available in pre-parsed formats
  • Recent Launch (in On-Demand and Realtime Batch):
    • Main Image link
      • The main image is curated by editors to represent a given article’s content. This can be used as a visual representation of the topic.
  • Launches:
    • “Summaries” available in all Wikimedia Enterprise APIs:
      • Easy to ingest text included with each revision to provide a concise description of the content without any need to parse HTML or Wikitext.

Content Integrity

[edit]
  • Goal: To provide more contextual information alongside each revision to help judge whether or not to trust the revision.
  • Active Public Beta Offerings:
    • Version Diffs [Realtime Streaming v2]:
      • Quantitative word changes in a new revision grouped by word attributes to provide understanding of the risk of a new revision’s changes.
    • Breaking News:
      • A boolean field detecting breaking news events to support prioritization when doing real-time ingestion of new Wikipedia pages

API Usability

[edit]
  • Goal: To improve the usability of Wikimedia Enterprise APIs
  • Launches:
    • Snapshots
      • Filtering available snapshots to group snapshots to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • On-demand
      • Cross language project entity lookups to connect different language projects for faster knowledge graph ingestion.
      • NDJSON responses to enable data consistency across WME APIs
      • Filtering and customized response payloads
    • Realtime Batch
      • Filtering available batch updates to group files to download
      • Parallel downloading capabilities to optimize ingestion speeds
    • Realtime Streaming
      • Event filtering by data field/value to narrow down revisions
      • Customized response payloads to control event size
      • Proper ordering of revisions to remove accidental overwrites
      • Lower event latency to ensure faster updates
      • NDJSON responses to enable data consistency across WME APIs

2022-Q4: Machine Readability POCs, Credibility Signals, and a new Realtime API feed in Beta

[edit]

New Realtime API is in closed beta:

  • As part of some of our larger infrastructural work to accommodate some of the expanding dataset needs, we
  • The beta Realtime API is a significant update and is a much more flexible event system providing:
    • Event filtering by data field/value to narrow down revisions
    • Customized response payloads to control event size
    • Proper ordering of revisions to remove accidental overwrites
    • Lower event latency to ensure faster updates
    • NDJSON responses to enable data consistency across WME APIs

Machine Readability:

  • Working out a larger roadmap but have prioritized which includes parsing out the first paragraph of Wikipedia articles (lede/summary) to add to the Wikimedia Enterprise APIs. Beginning work on this feature.

Credibility Signals:

  • We’ve released the first version of “Diffs” into a closed beta, a json payload that quantifies changes in language between two revisions. We’re testing the feature across a few popular Wikipedia languages for accuracy and usefulness.
  • Our Breaking news signal has a proof of concept. We’re testing reliability and accuracy of results on this signal that detects if new entries on Wikipedia relate to exogenous breaking news.
  • More context on this work: What are Credibility Signals?

We welcomed three new team members!

2022-Q3: Preparing the future of WME APIs

[edit]

New API Versions in the works:

  • We’re working on a new version of the WME Snapshot, Realtime, and On-demand APIs with a focus on filtering/flexibility, scalability, and the ability to more easily expand provided data signals without overloading the architecture.

Credibility Signals:

  • Francisco joined to produce a longer term roadmap of what Credibility Signals could be based on deep dive of research done over the summer. A summary of his work is to come in February 2023.

New Team Members:

  • Francisco Navas, Product Research lead for Content Integrity and Credibility Signals

2022-Q2: Self Registration and Credibility Signals

[edit]
  • Self registration:
    • Responding to feedback around accessibility, we have been working to improve the ability for individuals and companies to get started working with Wikimedia Enterprise APIs. We are building a turnkey flow to sign up and get started using our products.
    • A major goal of this access to provide the ability to work with our APIs to more interested people as well as garner more feedback to help us understand how we can tackle problems around using Wikimedia data outside of the Wikimedia ecosystem - something we have done quite a bit of qualitative research on - see Research Study below.
  • Credibility Signals:
    • In order to help Wikimedia data reusers understand what they are receiving, especially when ingesting all of the changes from a project in real time - we are creating a series of "signals", or individual data points, to help give more context to what has changed in a revision as it happens. Our first effort on this front is focused on turning changes into quantitative measures like "text differences" on new revisions. We plan to release this work into beta to try it and continue to evolve and experiment towards a better answer to some of these challenges.

2022-Q1: Release work, Uptime Monitoring, and new team members!

[edit]
  • Release work:
    • We have received an enormous amount of great feedback on phabricator and from initial users of Wikimedia Enterprise APIs that have kept us busy improving the stability of the product.
    • We have had some delays on our new architecture work and fully moving over versus prioritizing some of the new feature work on version 1.0. In the coming months, we plan to wrap the new architecture work up and release it as version 2.0.
  • Uptime Monitoring:
    • As our SLAs are a major value offering of Wikimedia Enterprise APIs, we have done quite a bit of work to improve our reliability of uptime monitoring. You can see our status page here.
  • New Team Members
    • We welcomed Haroon Shaikh to the team as our Engineering Manager. He is welcomed at an important time as we start to take in great technical feedback on our projects to triage and improve.

2021-10: Website Launch and Wikimedia Dumps release!

[edit]
  • Website Launch:
    • Our website is live! Check it out
    • Launched in this is our initial product offering details along with some pricing and sign up information.
  • Wikimedia Dumps release!
    • Wikimedia Dumps now has Wikimedia Enterprise dumps! Give it a download and please provide feedback to our team as you see relevant
    • Reminder: The Daily and Hourly Diffs are available on WMCS currently

2021-09: Launch! Building towards the next version and public access

[edit]
  • V1 launched on 9/15/2021: This month we stepped out of beta and fully launched v1 of Wikimedia Enterprise APIs. V1 APIs include:
    • Real Time:
      • Streaming: Three real time streams of all of the current events happening across our projects. You can hold this connection indefinitely and returns you the same data model as the others so that you can get all of the information in just one event object. The three streams are:
        • page-update: all revisions and changes to a page across the projects
        • page-delete: all page deletions to remove from records
        • page-visibility: highly urgent community driven events within the projects to reset
      • Batch: An API that returns a zip file containing all of changes with in a day of all "text-based" Wikimedia projects
    • Snapshot: An API that returns a zip file containing all of changes with in a day of all "text-based" Wikimedia projects
    • On-demand: An API that allows you to lookup a single page in the same JSON structure as the other endpoints.
  • Implementing new architecture:
    • We are starting to implement the architecture that we've been working on in past months to move towards a more flexible system that is built around streaming data. More information to be shared on our mediawiki page soon.
    • We are also working on rewriting some of our existing launch work into the new process - this is a lot of repurposing code but making for a stronger and more scalable system.
    • After this, we will begin the implementation of Wikidata, more credibility signals, and flexible filtering into the suite of APIs.
  • Public Access:
    • The Daily and Hourly Diffs are available on WMCS currently
    • We are planning to launch with Wikimedia Dumps soon as we launch hashing capabilities in the APIs in v1! Stay tuned.

2021-08: Roadmap Design and Building towards our September Launch!

[edit]
  • Roadmapping the next six months:
    • Wikidata:
      • Wikidata is a heavily used project by Wikimedia Enterprise's persona of commercial content reusers. Looking into the future, it is important for us to include "text-based" projects as well as Wikidata in the feeds that we create.
      • Our goal is to add Wikidata to the Firehose streams, Hourly Diffs, and Daily Exports giving Enterprise users the ability to access all of the projects (except Commons) in one API suite.
    • Credibility Signals
      • As we work to solve the challenges of reliably ingesting in real time Wikimedia data at scale, there are two big problems that still come with our data: Content Integrity and Machine Readability.
      • Wikimedia data reusers are not necessarily savvy in the nuances of the communities efforts to keep the projects as credible as possible and miss much of the context that comes with revisions that might help inform whether or not a new revision is worth replacing in an external system. This is exacerbated as reusers aim to move towards real time data on projects that are always in flux.
      • We plan to draw out the landscape of what signals can be included alongside real time and bulk feeds of new revisions to help end users add more context to their systems. Stay tuned here.
    • Flexible APIs:
      • Customizable Payload: With the ever expanding data added to our schemas, we need more flexibility on the payloads that end users would like. This is not easy or possible for Hourly Diffs or Daily Exports since those files are pre-generated and static but we aim to work on this capability across the Firehose and Structured Content APIs.
      • Enhanced Filtering: Since there are so many different data points coming through the feeds, end users will start to build their comfortability of ingestion around a few feeds. It is imperative that we provide the ability to filter beyond client side so that we can limit the direct traffic on end user's systems. This also provides a much easier user experience for users o the APIs.
  • September Launch:
    • We are all hands on deck building and processing towards our launch of our initial launch product.

2021-07: Onboarding, Architecture, and Launch Schema

[edit]
  • Added some new folks to our engineering team:
    • Welcome Prabhat Tiwary, Daniel Memije, and Tim Abdullin! They join us with each different perspectives and experiences adding substantial experience and capacity to our team.
    • With this came a lot of work stepping back and building onboarding documentation to make sure our team can grow and folks can join and contribute to our work.
  • New Architecture
    • As Wikimedia Enterprise APIs become more defined and complicated, we have started to draw out what a target architecture would look like. We are doing a lot of planning and taking time to think through what a streaming pipe should look like.
    • Our original architecture was centered around the solution of "Exports" and less around the real-time component, which in the long run will create flexibility issues with how we store and move data around our architecture.
  • Data Model / API Schema:
    • We have decided on a target schema, dataset, and set of APIs for our move out of beta in September. See more on our documentation page here


2021-06: Parsing HTML, Schema, API Organization, and Public Access

[edit]
  • Parsing HTML
    • We are entering the world of "what we can do to make the data easier to use" as we near having reliable pipes as the core of the Enterprise product.
    • First stop, parsing HTML. We are working with the Parsing team to find ways that Enterprise can support the open-source project to make parsing Parsoid HTML easier at scale for our end users.
  • Data Model / API Schema:
    • We are sending our schema work into the technical decision making process at the Wikimedia Foundation, follow on this ticket from the architecture team.
    • We have decided to adopt snake_case in our APIs as it has more flexibility with non-english languages, as we look down the line of more accessible apis.
  • Launch API Organization
    • Next week we will add to our docs page our final API name-spacing and structure for launch, we are including endpoints to quickly discern if anything has changed from project to project. Stay tuned here, I'm just typing them up in draft.
  • Public Access

2021-05: Schema, Public Access, Documentation, and Firehose

[edit]
  • Data Model / API Schema:
  • Public Access:
  • Documentation:
    • For now, we are hosting our documentation on-wiki here until we build out our larger sitemap for the Wikimedia Enterprise product. This work is in progress but feel free to watch that page for updates.
    • We are live on phabricator and all Wikimedia Enterprise related technical work is documented on our board!
  • Firehose API:
    • We have scoped the v1 release of the Firehose API and it will include filtering of Project and Page-Types (namespaces) for easier ingestion. Track progress here.
    • The Firehose will include the data from the above schema in a real time feed.

2021-04: Beta, Transparency, and Roadmap

[edit]
  • Beta Launch!:
    • The team launched a "closed beta" for our bulk and structured-content api endpoints! So far, great feedback but still working through kinks that come with a beta offering.
    • Follow this ticket for more information on when public access will be available via Wikimedia Database Dumps. Note these will be experimental, if interested in providing feedback, feel free to post on our phabricator board - we appreciate it!
    • We are finalizing a timeline with the Technical Engagement team to find how we can provide access to folks with access to their tools. Stay tuned.
  • Project transparency improvements:
    • We are moving all of Wikimedia Enterprise's project management to our Phabricator board over the next week or two.
    • We are reflecting/iterating on our open-source workflow to provide a better window into our Github push schedule for those who are interested in following along. More to come here.
  • Roadmap:
    • The next big roadmap item is refining the "data schema" work we have already done and publishing updates here. We are looking to include more contextual data to revisions as part of our ingestion feeds.

2021-03: Community conversations

[edit]
  • Refreshed documentation
    • Publication of completely refreshed documentation on MediaWiki.org and Meta. See Meta talkpage with significant amount of community feedback/comment.
  • Landing-page website
    • Launched! Incremental improvements in temporary code.
    • The website content itself is temporary and a placeholder until a fully featured page is launched alongside the product in a few months
mechrevo是什么牌子的电脑 士大夫什么意思 古代的天花是现代的什么病 空腹血糖偏高是什么原因 焦虑症是什么原因引起的
为什么会感染幽门螺旋杆菌 抗核抗体是什么 4a是什么意思 提上日程是什么意思 白带发黄是什么原因引起的
覆盆子有什么功效 微盟是做什么的 骁字五行属什么 朝圣者是什么意思 怀孕吃什么水果最好
核磁共振检查什么 女人右手断掌代表什么 舌头干燥吃什么药 白斑不能吃什么 星链是什么
手上起水泡是什么原因hcv9jop6ns6r.cn 孙悟空被压在什么山下wuhaiwuya.com 木吉他什么牌子比较好hcv7jop9ns5r.cn 眼睛疼吃什么药效果最好hcv8jop9ns2r.cn 眉毛下方有痣代表什么hcv8jop5ns7r.cn
什么是篮球基本功hcv8jop6ns8r.cn 移植后可以吃什么水果hcv9jop3ns4r.cn 流产后吃什么水果最佳hcv9jop5ns1r.cn 早上起来眼皮肿是什么原因hcv9jop0ns9r.cn 哼哈二将是什么意思hcv7jop6ns6r.cn
灌注治疗是什么意思hcv8jop9ns6r.cn 去威海玩需要准备什么hcv8jop0ns1r.cn 脚痛挂什么科hcv9jop6ns6r.cn eb病毒是什么病hcv7jop7ns2r.cn 孕妇喝什么牛奶对胎儿好aiwuzhiyu.com
神隐是什么意思hcv9jop6ns7r.cn up是什么意思weuuu.com 什么是花青素hcv8jop6ns3r.cn 发烧为什么会浑身酸疼hcv9jop2ns8r.cn 三伏天吃什么对身体好hcv8jop6ns5r.cn
百度