이 플러그인은 최근 3개의 주요 워드프레스 출시와 시험 되지 않았습니다. 워드프레스의 좀 더 최근 버전으로 이용할 때 더 이상 관리되지 않고 지원되지 않고 호환성 문제가 있을 수 있습니다.

FeedWordPress Duplicate Post Filter

설명

This is a filter for the FeedWordPress plugin. If you don’t use FeedWordPress this plugin will not be useful to you.

I wrote this filter after seeing the same post in my database 32 times. Maybe you’ve had the same problem, and I hope this filter helps you solve it.

The filter works by hooking the “syndicated_post” action of FeedWordPress and the “save_post” action in the core of WordPress.

For each potential post from a feed, the plugin attempts to find an identical SHA1 hash of the first 1024 non-whitespace characters (stripped of HTML tags) of the post content.
If it finds a match the post is skipped. If not, the post is processed and a hash is inserted into the post’s metadata. (Stored as key _dpf.)

NOTE: No filter is going to be 100% accurate. This filter will stop most or some duplicates, but in all likelihood, will not stop all of them.
If you want a filter that stops all duplicates, this isn’t your solution. In my test installation I had 5 posted duplicates out of
125 syndicated posts (4% false negative) out of a corpus of about 300 duplicates. Most of those false negatives were due to slightly different HTML markup
in the post content itself. So my approach isn’t perfect, but it is “good enough” for me.

Important Note

This filter can only check future syndication posts. Whatever duplicates are already present in your WordPress installation you’ll have to remove/clean-up on your own.

License

Copyright (C) 2012 by Mark R. Allen
All rights reserved.

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

  • Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
  • Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
  • Neither the name of Mark Allen nor the names of any contributors may be used to endorse or promote products derived from this software without specific prior written permission.

설치

  1. Download the plugin and uncompress it.
  2. Plop it in the same directory as feedwordpress.php
  3. Enable this filter in WordPress by visiting the “Plugin” menu and activating it.

후기

이 플러그인에 대한 평가가 없습니다.

기여자 & 개발자

“FeedWordPress Duplicate Post Filter”(은)는 오픈 소스 소프트웨어입니다. 다음의 사람들이 이 플러그인에 기여하였습니다.

기여자

자국어로 “FeedWordPress Duplicate Post Filter”(을)를 번역하세요.

개발에 관심이 있으십니까?

코드 탐색하기는, SVN 저장소를 확인하시거나, 개발 기록RSS로 구독하세요.

변경이력

1.5 – Remove HTML tags before computing hash.

1.4 – Overhauled strategy for duplicate detection. This works better than previous strategies.

1.3 – Fixed a dumb typo. Rewrote the DB query using better wpdb methods (prepare and query). Changed from htmlescape2 to esc_html WP function.

1.2 – Escaping the single quote (‘) character sometimes gives a conversion of ’ instead of ’ This caused a false negative and duplicate posts would be stored. Consider this release a temporary hack until I write a proper regex based title escape function.

1.1 – Characters in post-titles like ‘&’ were not getting converted to HTML entities. This caused the post title comparison to register a false negative. (So duplicate posts would be added to your feed.)

1.0 – initial release