Skip to main content

Ultimate Google manual penguin penalty link removal guide

G = Google
Where do you start with a post penguin backlink clean up with a new client who spent most of last year removing links they built with their previous agency; and have submitted more than 8 “rejected” reconsideration requests in a year from a manual penalty? A lot of posts and talks recently have discussed this, from the start of the campaign to the execution, but how would you begin and start a failed one? Here’s my approach to establishing – ALL the URLs of your existing backlinks, and which links have been disavowed already.
This blog post is split into 4 sections:
Scenario Our new client had spent most of last year with their previous agency working on link removal for links they built and submitting 8 reconsideration requests after feeling the heat  from a manual penalty. The problem we faced was that the client had recently fired their agency which:
  1. Built them all the links deemed low quality
  2. Worked on removing the links
  3. Had a clearer picture of where they were stood in the current link removal process
Our real challenge was ensuring the next reconsideration request was actually successful and understanding:
  1. Which links that were built from the previous agency were still live?
  2. Which links that the agency built were removed?
We did have a  list of URLs that had been historically built, featuring sites like: Analysis & Outcome To summarise the results and how effective this analysis was:

Total Links Built Historically

4,171

 

Standard Method of Exporting Data & VLOOKUP

Revised Method of Exporting Data & VLOOKUP

Identified

140

717

 % of built links

3%

17%

% difference

467% increase in links identified

Essentially the conventional way to do this would be to use a VLOOKUP and comparing existing links according to your data and links built.  However what you will find is link types such as directories and social bookmark URLs do change, and through this method we found over a 400% difference in the discrepancy of links found. Tools In terms of tools, I used the following: –      Microsoft Excel –      Google Webmaster Tools –      MajesticSEO

Part 1:  Establishing The Poor Links

Step 1: Gather Your Links – (TAB NAME: Master Data) Open up an excel file and paste in all the links that you have on record that were ever built into an excel sheet: Gather your Links Step 2: Gather Links Using MajesticSEO –  (TAB NAME: Backlink Data) Use MajesticSEO and generate a backlink report to pull through all your data to include the: 1)    Fresh Data 2)    Historic Data Then download all the backlinks pointing to the domain. Majestic SEO Paste in all the data you have and be sure to insert a new row so you have on record which data set is old or new like below into a new tab: Link Remover Step 3: De-Duplicate Your Backlink Data You will find that by putting the fresh and historic data within Excel you will have duplicates, work on removing the duplicate backlinks so that only the unique values remain. De-Duplicate Backlink Data   Step 4: Gather Links Using Google Webmaster Tools Next step, go into Google Webmaster Tools and export all the links you have and click on the box that says “Download Latest Links”. Gather using Google Webmaster Paste all the links you have into a new tab within your existing excel document Step 5: De -Duplicating MajesticSEO & Google Webmaster Tool Backlinks To make sure you do not have any duplicates you can use a VLOOKUP to help you highlight which backlinks show up in MajesticSEO and Google Webmaster Tools. Vlookup Step 6: What It Should Look Like (TAB Name: Backlink Data) After filtering the data, copy and paste the unique backlinks back to the tab sheet where you had the MajesticSEO data you should have something like the screen backlink data

Part 2: Finding Existing Backlink Domain

The main problem faced is that with links such as directories, social bookmarks the original URLs do change or the webmasters change the path folder for example, the original URLs: https://www.zinglisting.com/index.php?c=392 https://your-debtmanagement-articles.com/article/17/society/ then may have changed to now: https://www.zinglisting.com/index.php?c=674 https://your-debtmanagement-articles.com/article/17/relationship/ So what can you do, to get the best picture of whether the links are still live, when outreaching for removal to webmasters? Step 1: Establishing Live Domains with Backlinks (TAB NAME: Master Data) In the tab where you have copy and pasted the historical links built,  use text to columns so that you only get the domains establishing live domains with backlinks Copy and paste the source URL into a new tab, then use text to columns so that you only get the domains and paste back into the historical links built tab PAk Hou Cheung 2 Step 2:  Finding Existing Backlink Data (TAB NAME: Backlink Data) Now go to your backlink data tab  where you have the list of Majestic and Google Webmaster tools data. PHC - BG - Pak Hou Cheung Then do the same with the source URL, and insert a new domain tab once completed to the right of column C, like below: Pak Hou Cheung - Image 9 Step 3: Establishing The Live Link Domains (TAB NAME: Master Data) Now go back to your Master data tab and do another VLOOKUP, and what this shows you is a clear picture of the domains that are still live and pointing to your site.

Part 3: Establishing The Location of URL in a Domain

Now you know the existing domains that were historically built and  are still live based on the vlookup data.  How do you locate the links on page to let the webmasters know where the links exist on their site and asked to be removed?   Essentially the next steps will show you data: 1) The number of historical links that were built and are still live 2) The exact URL location of the links that were built historically that may have been changed Step 1: Create a Copy of the Backlink Data Tab Make a copy of your backlink data tab. Now flip the data in Column C Source URL with column B Domains. You need to do this so that the next steps mean that the VLOOPUP can read the data correctly. Old Tab Vs New Tab - Link Removal Pak Hou Step 2: Locating The Live Backlink URLs (TAB NAME: Master Data Tab) Now go to the Master Data Tab:
  1.  Create a new column to the right of domain tab
  2. Create a new VLOOKUP based on the Copy of the Backlink Data Tab
Image 2 And now within this data shows you: 1) Overall picture of live links that were historically built 2) URL location of the live links that were historically built

Part 4: Understand What’s Been Disavowed?

With many websites who have been hit by a penalty they will inevitably Disavow websites that have be unable to remove, this final part will help you to understand: 1)  Websites that have been Disavowed. 2)  Show to Google search quality team that you’re still making a conscious effort of removing links that are still live. Step 1:  Export Disavowed Websites Go to the Disavow Links Tool page and export the data Disavow Tool FIle - Pak Hou Cheung   Step 2:  Identify Disavowed Websites 1)  Create a new tab within your existing excel document, and paste in your Disavow domains. 2)  Go to your Master Data Tab and create a new column and have the root domain in one column Pak Hou Cheung 2 - Link Remoove 3)   Then use a VLOOKUP to help you identify which root domains have been Disavowed to give you a clearer picture of the websites. Pak Hou Cheung - Disavowed Websites - Google PHC Finally you’re done and essentially you now have a clear picture of: 1) Which links exist 2) Which links to remove 3) Which domains that have been Disavowed 4) Which domains have not been Disavowed Results

Total Links Built Historically

4,171

 

Standard Method of Exporting Data & VLOOKUP

Revised Method of Exporting Data & VLOOKUP

Identified

140

717

 % of built links

3%

17%

% difference

467% increase in links identified

In Conclusion….. With any new campaign you start on it’s always important to understand where you stand and not rush into anything new, just because everyone else is doing it.   By cleaning up on your existing problems, moving forward this gives you the best platform to build from. What are your thoughts on this method of identifying links in this scenario?  Be great to hear your ideas and what you think about this too!